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GLYCOSYL SULFOTRANSFERASE-3 

CROSS-REFERFNCK TO RELA TFD APPLICATIONS 
This application is a continuation-in-part of application serial no. 09/045,284 filed 
on March 20, 1 998, the disclosure of which is herein incorporated by reference. 



INTRODUCTION 

Field of the Invention 

The field of the invention is cell adhesion, particularly selectin mediated cell 
1 0 adhesion, as well as the treatment of disease conditions related thereto. 
O Background of the Invention 

n Sulfotransferases are enzymes that catalyze the transfer of a sulfate from a donor 

j " compound to an acceptor compound, usually placing the sulfate moiety at a specific 

location on the acceptor compound. There are a variety of different sulfotransferases 
1 5 which vary in activity, i.e. with respect to the donor and/or acceptor compounds with 
which they work. Known sulfotransferases include those acting on carbohydrate: 
heparin/heparan sulfate N-sulfotransferase (NST); chondroitin 6/keratan 6 sulfate 
sulfotransferase (C6ST/KSST); galactosylceramide 3'-sulfotransferase; heparan sulfate 2- 
sulfotransferase (Iduronic acid); HNK-1 sulfotransferase (3-glucuronic acid); heparan 
20 sulfate D-glucosamino 3-O-sulfotransferase (3-OST);etc, as well as those acting on 
phenols, steroids and xenobiotics: aryl sulfotransferase I & II, hydroxy-steroid 
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sulfotransferases I, II & III, dehydroepiandrosterone (DHEA); etc. Sulfotransferases play 
a central role in a variety of different biochemical mechanisms, as the presence of a 
sulfate moiety on a particular ligand is often required for a particular activity, e.g. binding. 
The presence of a sulfate moiety on selectin ligands has been shown to be 
5 important for selectin binding to occur. See Imai et al., Nature (1 993) 361:555-557 and 
Imai et al., Glycoconjugate J. (1993) 10:34-39, as well as U.S. Patent No. 5,695,752. 
Several selectin ligands have, to date, been identified. The L-selectin endothelial ligands 
in mouse that have been identified are: CD34, GlyCAM-1, MAdCAM-1 and sgp200. In 
addition, PSGL-1 has been identified as a leukocyte ligand for P-, E-, and L-selectin. 
10 Endothelial ligands for L-selectin in humans are still poorly defined, but include CD34 
and podocalyxin. 

Selectin mediated binding plays an important and prominent role in a variety of 
biological processes. Selectins are lectin like cell adhesion molecules that mediate 
leukocyte-endothelial, leukocyte-leukocyte, leukocyte-platelet, platelet-endothelial and 
2 15 platelet-platelet interactions. One critical biological process in which selectin mediated 
1 y binding plays a role is the maintenance of immune surveillance. 

I Maintenance of immune surveillance depends on the constant recirculation of 

lymphocytes from the blood through the vascular wall into the tissues and eventually back 

into the blood. Lymphocyte recruitment from the blood into all secondary lymphoid 

o 

M 20 organs (except the spleen) as well as into many sites of chronic inflammation is mediated 
by a specialized postcapillary venule called a high endothelial venule. These vessels are 
defined by the distinct, cuboidal morphology of their endothelial cells and their luminal 
presentation of ligands for the leukocyte adhesion molecule, L-selectin. This lectin-like 
adhesion molecule is expressed on all classes of leukocytes in the blood and is responsible 
25 for the initial tethering and rolling of a leukocyte on the endothelium prior to subsequent 
integrin mediated firm arrest and transmigration. 

Although selectin mediated binding events play a critical role in normal 
physiological processes, disease conditions do exist for which it is desired to regulate or 
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modulate, e.g. limit or prevent, the amount of selectin mediated binding that occurs. Such 
conditions include: acute or chronic inflammation; autoimmune and related disorders, 
tissue rejection during transplantation, and the like. 

As the above conditions all result from selectin mediated binding events, there is 
great interest in the elucidation of the mechanisms underlying such binding events. There 
is also great interest in the identification of treatment methodologies for these and related 
disease conditions, as well the identification of active agents for use therein. 

As such, there is continued interest in the identification of participants in the 
selectin binding mechanism, including enzymatic agents, and the elucidation of their 
role(s) in selectin mediated binding events, as well as the development of therapies for 
disease conditions arising from such binding events, 
^levant Literature 

Chondroitin-6-sulfotransferase is disclosed in EP 821 066, as well as in Fukuta et 
al., "Molecular Cloning and Characterization of Human Keratan Sulfate Gal-6- 
Sulfotransferase," J. Biol. Chem. (December 19, 1997) 272: 32321-32328; Habuchi et al., 
"Enzymatic Sulfation of Galactose Residue of Keratan Sulfate by Chondroitin 6- 
Sulfotransferase," Glycobiology (January 1996) 6:51-57; Habuchi et al., "Enzymatic 
Sulfation of Galactose Residue of Keratan Sulfate by Chondroitin 6-Sulfate by 
Chondroitin 6-Sulfotransferase," Glycobiology (January 1996) 6:51-57; Fukuta et al., 
"Molecular Cloning and Expression of Chick Chondrocyte Chondroitin 6- 
Sulfotransferase," J. Biol. Chem. (1995) 270: 18575-18580; and Habuchi et al., 
"Purification of Chondroitin 6-Sulfotransferase Secreted from Cultured Chick Embryo 
Chondrocytes," J. Biol. Chem. (1993) 268: 21968-21974. 

References providing background information on selectin mediated binding 
include: Baumhueter et al., "Binding of L-Selectin to the Vascular Sialomucin CD34," 
Science (October 15, 1993): 436-438; Boukerche et al., "A Monoclonal Antibody 
Directed Against a Granule Membrane Glycoprotein (GMP-1 40/PADGEM, P-selectin, 
CD62P) Inhibits Ristocetin-Induced Platelet Aggregation," Br. J. Haematology (1996) 92 
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442-451; Celi et al., "Platelet-Leukocyte-Endothelial Cell Interaction on the Blood Vessel 
Wall," Seminars in Hematology (1997) 34: 327-335; Frenette et al.. "Platelets Roll on 
, Stimulated Endothelium In Vivo: An Interaction Mediated by Endothelial P-selectin," 

Proc. Natl. Acad. Sci. USA (August 1995) 52:7450-7454; Girard & Springer, "High 
5 Endothelial Venules (HEVs): Specialized Endothelium for Lymphocyte Migration," 
Immun. Today (1995) 16: 449-457; Hemmerich et al., "Sulfation Dependent Recognition 
of High Endothelial Venules (HEV)-Ligands by L-Selectin and Meca79, and Adhesion- 
Blocking Monoclonal Antibody," J. Exp. Medicine (December 1994) 180: 2219-2226; 
262 Lasky et al , "An Endothelial Ligand for L-Selectin Is a Novel Mucin-Like 
10 Molecule," Cell (June 12, 1992) 69:927-938; Rosen & Bertozzi, "The Selectins and Their 
Ligands," Current Opinion in Cell Biology (1994) 6: 663-673; and Sawada et al., 
H "Specific Expression of a Complex Sialyl Lewis X Antigen On High Endothelial Venules 

§ of Human Lymph Nodes: Possible Candidate for L-selectin Ligand," Biochem. Biophys: 

p Res. Comm. (May 28, 1993) 193: 337-347; as well as U.S. Patent No. 5,580,862. 

U.S. Patent No. 5,695,752 describes methods of treating inflammation through 
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administration of sulfation inhibitors. 



SUMMARY OF THE INVENTIO N 
A novel human glycosyl sulfotransferase (GST-3 or HEC-GlcNAc6ST) and 
20 polypeptides related thereto, as well as nucleic acid compositions encoding the same, are 
provided. The subject polypeptide and nucleic acid compositions find use in a variety of 
applications, including research, diagnostic, and therapeutic agent screening applications, 
as well as in treatment therapies. Also provided are methods of inhibiting selectin 
mediated binding events and methods of treating disease conditions associated therewith, 
25 particularly by administering inhibitors of GST-3 and/or KSGal6ST (or an HEC specific 
homologue thereof) 
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ppp y BE5C&IEH QM ™ THF FIGURES 
Fig. 1 provides the cDNA sequence and amino acid sequence of Human GST-3. 
The cDNA sequence is SEQ ID NO:02 and the amino acid sequence is SEQ ID NO:01 . 
Fig. 2 provides the hydropathy plot for GST-3 (HEC-GlcNAc6ST). The 
5 hydrophobic^ was calculated by the method of Kyte and Doolittle J. Mol. Biol. (1982) 
157, 105-32; using a window of 1 1 amino acids. 

Fig. 3 provides a representation of the derivation of primers that were employed in 
homology PCR on HEV-cDNA. Peptide sequences GST-1 and GST-2 were aligned with 
several known sulfotransferases retrieved from the Genbank database using the ClustalW 
10 (Thompson et aL, Nuc. Acids Res. (1994) 22: 4673-4680) algorithm. Three consensus 
regions A, B, & C were identified. These are depicted above in order 5' to 3' (N-terminal 
to C-terminal) with the intermittent and flanking regions of less or no homology 
truncated. Highly degenerate forward primers A+ and B+ and reverse primers B- and O 
containing toggle bases as well as inosine were designed as decribed in the experimental 
15 section. These primers encode a maximal number of permutations at the amino-acid level 
in order to accomodate most patterns highlighted above in red or brown. 

Fig. 4 provides a representation of the protein domain structure of C6ST/KSST 

and GST-1, GST-2 and GST-3. 

Fig. 5 provides the alignment of regions of high conservation among human 
20 carbohydrate 6-sulfotransferases (GaI-6, GlcNAc-6 and GalNAc-6). Protein sequences 
were aligned using the ClustalW algorithm (Thompson et aL, 1994, supra). Black shading 
indicates identity at that residue among at least three of the sequences, grey shading 

indicates similarity. 

Fig. 6 provides a graphical representation of the amount of sulfate (S0 4) 
25 incorporation into GlyCAM-1 in COS7 cells cotransfected with: (a) GlyCAM-l/Ig 
expression plasmid plus GST-3 expression plasmid; and (b) GlyCAM-l/Ig expression 
plasmid plus vector lacking GST-3 insert; and (c) untransfected cells. 



UC Ref: 98-299-2 
B&RRef:6510-107CIP 

Roche Ref: R0040A-ORD .5. 

F:\DOCUMENT\6S10 • UCALM07CIP*«em vplkauon-wpd 



Fig. 7 provides the sulfated O-linked carbohydrate chains of GlyCAM-1. 
Oligosaccharides bearing the 6-sulfo sialyl Lewis x (sLe x ) and the 6'-sulfo sLe x motif, 
which extend from the core 2 structure are shown. The presence of the 6',6-disulfo sLe x 
motif is strongly suspected. Structures of the more complex O-linked chains of GlyCAM- 
1 remain to be determined. 

Fig. 8 provides a graphical representation of the results of an assay designed to 
study the sulfation of synthetic acceptors by HEC-GlcNAc6ST. Microsomal extracts from 
COS cells transfected with HEC-GlcNAc6ST cDNA or vector cDNA (mock) were reacted 
with [ 35 S]-PAPS and the disaccharide or trisaccharide acceptors (see Table 3). After a 2 
hour incubation at 37°C, radiolabeled acceptor was isolated by reversed-phase 
chromatography and incorporation was quantified by liquid scintillation counting. Each 
datapoint represents the average of triplicate determinations. The error bars indicate 
SEMs. 

Fig. 9 provides the results obtained when the disaccharide acceptor sulfated by 
extracts of HEC-GlcNAc6ST transfectants (see A) was subjected to mild acid hydrolysis. 
The hydrolysate was analysed by high pH anion exchange chromatography using the 
following standards: 1, GlcNAc-3-S0 3 -; 2, [ 35 S]-S0 4 2 -; 3, GaM-SOf; 4, Gal-3-S0 3 -; 5, 
GlcNAc-6-S0 3 -; 6, Gal-6-S0 3 -. The hydrolysate from the disaccharide acceptor sulfated 
by HEC-GlcNAc6ST transfectant microsomes (-a-) showed only two radioactive peaks, 
that comigrated with free sulfate (left peak) and GlcNAc-6-sulfate (right peak). 
Hydrolysate from disaccharide acceptor sulfated by mock transfectant microsomes 
(—0— ) showed only free sulfate. 

Fig. 10 provides the results of an assay designed to study the sulfation of 
GlyCAM-l/IgG conferred by HEC-GlcNAc6ST and KSGal6ST cDNAs. COS cells were 
transfected with combinations of plasmids encoding GlyCAM-l/IgG and the two 
sulfotransferases as indicated. Transfected cells were cultured in the presence of [ 35 S]- 
sulfate and recombinant GlyCAM-l/IgG was isolated from the conditioned medium by 
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passage over protein A-agarose. 1% of the captured material was subjected to SDS- 
PAGE and the remainder was processed for analysis of sulfated carbohydrate. 
Densitometric quantification of the Coomassie Blue-stained bands following SDS-PAGE 
showed that each lane, except the control lane without GlyCAM-l/IgG plasmid, contained 

5 approximately equal amounts of GlyCAM-l/IgG. Analysis of sulfated carbohydrate in 
GlyCAM-l/IgG coexpressed with HEC-GlcNAc6ST or KSGal6ST is provided in the 
graph of Fig. 10. Sulfated mono- and disaccharides derived from [ 35 S]-sulfate labeled 
GlyCAM-l/IgG produced by COS cells transfected with HEC-GlcNAc6ST 
(_ A _) or KSGal6ST (--#--), as described above, were analyzed by high pH anion 

10 exchange chromatography after acid hydrolysis. The standards were as follows: 1, 
GlcNAc-3S0 3 -; 2, [ 35 S]-S0 4 2 "; 3, Gaipi-4[S0 3 --6]GlcNAc; 4, 

[S0 3 --6]Gaipi-4GlcNAc; 5, Gal-4S0 3 ~; 6, Gal-3S0 3 -; 7, GlcNAc-6S0 3 -; 8, Gal-6S0 3 ". 

Figs. 1 1A to 1 1C provides the results of an assay in which CHO/FTVII/C2GnT 
cells were transfected with different combinations of cDNAs encoding human CD34, 

15 HEC-GlcNAc6ST and KSGal6ST. cDNAs encoding FTVII and C2GnT were included in 
all transfection mixtures to ensure sufficient expression levels of these enzymes. Cells 
were stained with the L-selectin/IgM chimera to detect ligand activity. Figs 1 1 A and 1 1 B 
are histograms showing L-selectin/IgM staining for the following transfections: CD34 
cDNA only (— , A and B); CD34/HEC-GlcNAc6ST/KSGal6ST cDNAs (— , A and B); 

20 CD34/HEC-GlcNAc6ST/KSGal6ST cDNAs with staining done in the presence of Mel- 
14 mAb (— , A); HEC-GlcNAc6/KSGal6ST cDNAs but no CD34 cDNA (— , B). The 
secondary staining reagent when used alone showed staining equal to that observed when 
Mel-14 mAb was used as an inhibitor. The isotype-matched control for Mel-14 showed 
no inhibitory effect. Fig 1 1C provides a the results of a two color analysis showing CD34 

25 expression (y-axis, staining with CD34-PE mAb) and L-selectin ligand activity (x-axis, 
FITC) in cells transfected with CD34 cDNA and cDNAs encoding HEC-GlcNAc6ST and 
KSGal6ST, alone or in combination as indicated. The horizontal bar is set such that all 
cells staining with the isotype matched control for the CD34 mAb are included in the 
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lower quadrants. The vertical bar is set to indicate the L-selectin/IgM staining of cells in 
which no sulfotransferase cDNA was included in the transfection mixture (lower left 
panel). The fraction of positive cells (as a percentage of the total) in each quadrant is 
indicated. The mean fluorescence intensity (MFI) for the cells in the upper right 
5 quadrants is indicated. The CD34 mAb did not interfere with staining by the L- 

selectin/IgM chimera. 

Figs 12 to 14 provide the results from rolling and tethering experiments which 
were conducted to elucidate the role of GST-3 and KSGal6ST in the biosynthesis of 
functional L-selectin ligands, as described in greater detail in the experimental section 
10 infra. 

\ t DETAILED DEST PIPTTON OF THF, INVENTION 

M ■ 

A novel human glycosyl transferase expressed in high endothelial cells (HEC) (i.e. 
15 GST-3 or HEC-GlcNAc6ST) and polypeptides related thereto, as well as nucleic acid 
compositions encoding the same, are provided. The subject polypeptide and/or nucleic 
acid compositions find use in a variety of different applications, including research, 
£ diagnostic, and therapeutic agent screening/discovery/ preparation applications. Also 

provided are methods of inhibiting selectin mediated binding events and methods of 
20 treating disease conditions associated therewith, particularly by administering an inhibitor 
of GST-3 and/or KSGal6ST, or an HEC specific homologue thereof. 

Before the subject invention is further described, it is to be understood that the 
invention is not limited to the particular embodiments of the invention described below, as 
25 variations of the particular embodiments may be made and still fall within the scope of the 
appended claims. It is also to be understood that the terminology employed is for the 
purpose of describing particular embodiments, and is not intended to be limiting. Instead, 
the scope of the present invention will be established by the appended claims. 
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In this specification and the appended claims, the singular forms "a," "an," and 
"the" include plural reference unless the context clearly dictates otherwise. Unless defined 
otherwise, all technical and scientific terms used herein have the same meaning as 
commonly understood to one of ordinary skill in the art to which this invention belongs. 

5 

Polypeptide Compositions 

A novel human glycosylsulfotransferase expressed in high endothelial cells 
(HEC), as well as polypeptide compositions related thereto, are provided. The term 
10 polyeptide composition as used herein refers to both the full length human protein as well 
as portions or fragments thereof. Also included in this term are variations of the naturally 
U occurring human protein, where such variations are homologous or substantially similar to 

the naturally occurring protein, as described in greater detail below, as well as 
corresponding homologs from non-human species, such as other mammalian species. In 
m 15 the following description of the subject invention, the terms GST-3 and HEC-GlcNAc6ST 
are used to refer not only to the human form of this novel sulfotransferase, but also to 
homologs thereof expressed in non-human species. 

The novel human glycosylsulfotransferase enzyme of the subject invention has 
| been named human glycosyl sulfotransferase 3 or huGST-3 or HEC-GlcNAc6ST. huGST- 

|3 20 3 is a type 2 membrane protein having a relatively short transmembrane domain and a 

short ammo-terminal cytoplasmic tail. huGST-3 has a 31 % amino acid sequence identity 
with CS6T/KSST (Habuchi et al., J. Biol. Chem. (1995) 240:4172-4179) as measured by 
using the "GAP" program (part of the Wisconsin Sequence Analysis Package available 
through the Genetics Computer Group, Inc. (Madison WI)), where the parameters are: 
25 Gap weight: 12; length weight:4. huGST-3 is capable of sulfating selectin ligands, 

particularly L-selectin ligands, e.g. GlyCAM-1. By sulfating selectin ligands is meant that 
huGST-3 is capable of catalyzing the transfer of a sulfate group from a donor compound 
to a position on a selectin ligand precursor as acceptor compound. Donor compounds 
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from which huGST-3 obtains sulfate groups for transfer to acceptor ligand compounds 
include 3-phosphoadenosine 5'-phosphosulfate (PAPS) and the like. Selectin ligands 
capable of being sulfated through huGST-3 action include E-, P- and L-selectin ligands, 
particularly L-selectin ligands, such as GlyCAM-1, CD34, MAdCAM-1, Sgp200, 
5 podocalyxin, and the like. huGST-3 is strongly predicted to have GlcNAc6-0- 
sulfotransferase (N-actylglucosamine-6-O-sulfotransferase) activity. 

Human GST-3 is a 386 amino acid protein having an amino acid sequence as 
shown in Fig. 1 and identified as SEQ ID NO:01 . huGST-3 has a molecular weight based 
on its amino acid of about 45 kDa to 46 kDa, and more specifically from about 45100 to 
1 0 45200 dalton, and specifically 45 1 04 dalton (using DNA Strider 1 .2 software). Since 

GST-3 is glycosylated, its true molecular weight is greater, and is likely to be in the range 
from about 45 to 85kDa, and more likely from about 50 kDa to 65 kDa. Expression of 
GST-3 in humans is highly restricted. For example, huGST-3 is expressed in HEC but not 
tonsillar lymphocytes, or primary cultured human umbilical vein endothelial cells 
S 15 (HUVEC). 

ry In addition to the huGST-3, also provided are GST-3 proteins that have the same 

expression pattern in humans as huGST-3, i.e. are highly restricted and expressed in HEC 
but not HUVEC or lymphocytes. huGST-3 homologs or proteins (or fragments thereof) 
from nonhuman species are also provided, including mammals, such as: rodents, e.g. 
20 mice, rats; domestic animals, e.g. horse, cow, dog, cat; and humans, as well as non- 
mammalian species, e.g. avian, and the like. By homolog is meant a protein having at 
least about 35 %, usually at least about 40% and more usually at least about 60 % amino 
acid sequence identity to the huGST-3 protein. 

Also provided are GST-3 proteins that are substantially identical to the huGST-3 
25 protein, where by substantially identical is meant that the protein has an amino acid 

sequence identity to the sequence of huGST-3 of at least about 35%, usually at least about 
40% and more usually at least about 60 %. 
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Also provided are KSGal6ST homologues that are selectively expressed in HEC. 
The nucleotide and amino acid sequence for KSGal6ST is reported in Fukuta et at, J. 
Biol. Chem. (December 19, 1997) 272:32321-32328. The subject HEC specific 
KSGal6ST homologues have a sequence that is substantially identical to KSGal6ST, 
where by substantially identical is meant that the protein has an amino acid sequence 
identity to the sequence of KSGal6ST of at least about 35%, usually at least about 40% 
and more usually at least about 60 %. 

The proteins of the subject invention (e.g. huGST-3 or a homolog thereof; an HEC 
specific KSGal6ST homologue) are present in a non-naturally occurring environment, e.g. 
are separated from their naturally occurring environment. In certain embodiments, the 
subject proteins are present in a composition that is enriched for subject protein as 
compared to its naturally occurring environment. For example, purified GST-3 is 
provided, where by purified is meant that GST-3 is present in a composition that is 
substantially free of non-GST-3 proteins, where by substantially free is meant that less 
than 90 %, usually less than 60 % and more usually less than 50 % of the composition is 
made up of non-GST-3 proteins. The proteins of the subject invention may also be present 
as an isolate, by which is meant that the protein is substantially free of other proteins and 
other naturally occurring biologic molecules, such as oligosaccharides, polynucleotides 
and fragments thereof, and the like, where substantially free in this instance means that 
less than 70 %, usually less than 60% and more usually less than 50 % of the composition 
containing the isolated protein is some other naturally occurring biological molecule. In 
certain embodiments, the proteins are present in substantially pure form, where by 
substantially pure form is meant at least 95%, usually at least 97% and more usually at 
least 99% pure. 

In addition to the naturally occurring proteins, polypeptides which vary from the 
naturally occurring proteins are also provided, e.g. GST-3 polypeptides. By GST-3 
polypeptide is meant an amino acid sequence encoded by an open reading frame (ORF) of 
the GST-3 gene, described in greater detail below, including the full length GST-3 protein 
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and fragments thereof, particularly biologically active fragments and/or fragments 
corresponding to functional domains, e.g. acceptor binding site (postulated to be the most 
5' consensus region A (see experimental section infra), the donor binding site, e.g. 
VRYEDL, and the like; and including fusions of the subject polypeptides to other proteins 
or parts thereof. Fragments of interest will typically be at least about 10 aa in length, 
usually at least about 50 aa in length, and may be as long as 300 aa in length or longer, but 
will usually not exceed about 1000 aa in length, where the fragment will have a stretch of 
amino acids that is identical to the subject protein of at least about 10 aa, and usually at 
least about 15 aa, and in many embodiments at least about 50 aa in length. 

The subject proteins and polypeptides may be obtained from naturally occurring 
sources or synthetically produced. Where obtained from naturally occurring sources, the 
source chosen will generally depend on the species from which the protein is to be 
derived. For example, huGST-3 is generally derived from endothelial cells of high 
endothelial venules (HEV) of human secondary lymphoid organs, such as tonsils. The 
subject proteins may also be derived from synthetic means, e.g. by expressing a 
recombinant gene encoding protein of interest in a suitable host, as described in greater 
detail below. Any convenient protein purification procedures may be employed, where 
suitable protein purification methodologies are described in Guide to Protein Purification, 
(Deuthser ed.) (Academic Press, 1990). For example, a lysate may prepared from the 
original source, e.g. HEC or the expression host, and purified using HPLC, exclusion 
chromatography, gel electrophoresis, affinity chromatography, and the like. 

Nucleic Acid Compositions 

Also provided are nucleic acid compositions encoding GST-3 proteins or 
fragments thereof, as well as the HEC-specific KSGal6ST homologues of the present 
invention. By GST-3 nucleic acid composition is meant a composition comprising a 
sequence of DNA having an open reading frame that encodes GST-3, i.e. a GST-3 gene, 
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and is capable, under appropriate conditions, of being expressed as GST-3. Also 
encompassed in this term are nucleic acids that are homologous or substantially similar or 
identical to the nucleic acids encoding GST-3 proteins. Thus, the subject invention 
provides genes encoding huGST-3 and homologs thereof. The human GST-3 gene has the 
5 nucleic acid sequence shown in Fig. 1 and identified as SEQ ID NO:02, infra. 

The source of homologous genes may be any species, e.g., primate species, 
particularly human; rodents, such as rats and mice, canines, felines, bovines, ovines, 
equines, yeast, nematodes, etc. Between mammalian species, e.g., human and mouse, 
homologs have substantial sequence similarity, e.g. at least 75% sequence identity, 
10 usually at least 90%, more usually at least 95% between nucleotide sequences. Sequence 
similarity is calculated based on a reference sequence, which may be a subset of a larger 
! J sequence, such as a conserved motif, coding region, flanking region, etc. A reference 

D sequence will usually be at least about 1 8 nt long, more usually at least about 30 nt long, 

5 and may extend to the complete sequence that is being compared. Algorithms for 

fi 1 5 sequence analysis are known in the art, such as BLAST, described in Altschul et al. 
j'J (1990), J. Mol. Biol. 215:403-10 (using default settings). The sequences provided herein 

U are essential for recognizing GSr-3-related and homologous proteins in database searches. 

K Nucleic acids encoding the GST-3 protein and GST-3 polypeptides of the subject 

2 invention may be cDNA or genomic DNA or a fragment thereof. The term "GST-3 gene" 

20 shall be intended to mean the open reading frame encoding specific GST-3 proteins and 
polypeptides, and GST-3 introns, as well as adjacent 5' and 3' non-coding nucleotide 
sequences involved in the regulation of expression, up to about 20 kb beyond the coding 
region, but possibly further in either direction. The gene may be introduced into an 
appropriate vector for extrachromosomal maintenance or for integration into a host 
25 genome. 

The term "cDNA" as used herein is intended to include all nucleic acids that share 
the arrangement of sequence elements found in native mature mRNA species, where 
sequence elements are exons and 3' and 5' non-coding regions. Normally mRNA species 
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have contiguous exons, with the intervening introns, when present, being removed by 
nuclear RNA splicing, to create a continuous open reading frame encoding a GST-3 
protein. 

A genomic sequence of interest comprises the nucleic acid present between the 
initiation codon and the stop codon, as defined in the listed sequences, including all of the 
introns that are normally present in a native chromosome. It may further include the 3 ' 
and 5' untranslated regions found in the mature mRNA. It may further include specific 
transcriptional and translational regulatory sequences, such as promoters, enhancers, etc., 
including about 1 kb, but possibly more, of flanking genomic DNA at either the 5 'or 3' 
end of the transcribed region. The genomic DNA may be isolated as a fragment of 100 
kbp or smaller; and substantially free of flanking chromosomal sequence. The genomic 
DNA flanking the coding region, either 3' or 5', or internal regulatory sequences as 
sometimes found in introns, contains sequences required for proper tissue and stage 
specific expression. 

The nucleic acid compositions of the subject invention may encode all or a part of 
the subject GST-3 protein. Double or single stranded fragments may be obtained from the 
DNA sequence by chemically synthesizing oligonucleotides in accordance with 
conventional methods, by restriction enzyme digestion, by PCR amplification, etc. For 
the most part, DNA fragments will be of at least 15 nt, usually at least 18 nt or 25 nt, and 

may be at least about 50 nt. 

The GST-3 genes are isolated and obtained in substantial purity, generally as other 
than an intact chromosome. Usually, the DNA will be obtained substantially free of other 
nucleic acid sequences that do not include a GST-3 sequence or fragment thereof, 
generally being at least about 50%, usually at least about 90% pure and are typically 
"recombinant", i.e. flanked by one or more nucleotides with which it is not normally 
associated on a naturally occurring chromosome. 



Preparation of GST-3 Polypeptides 
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In addition to the plurality of uses described in greater detail in following sections, 
the subject nucleic acid compositions find use in the preparation of all or a portion of the 
GST-3 polypeptides, as described above. For expression, an expression cassette may be 
employed. The expression vector will provide a transcriptional and translation^ initiation 
region, which may be inducible or constitutive, where the coding region is operably linked 
under the transcriptional control of the transcriptional initiation region, and a 
transcriptional and translational termination region. These control regions may be native 
to a GST-3 gene, or may be derived from exogenous sources. 

Expression vectors generally have convenient restriction sites located near the 
promoter sequence to provide for the insertion of nucleic acid sequences encoding 
heterologous proteins. A selectable marker operative in the expression host may be 
present. Expression vectors may be used for the production of fusion proteins, where the 
exogenous fusion peptide provides additional functionality, i.e. increased protein 
synthesis, stability, reactivity with defined antisera, an enzyme marker, e.g. 
-galactosidase, etc. 

Expression cassettes may be prepared comprising a transcription initiation region, 
the gene or fragment thereof, and a transcriptional termination region. Of particular 
interest is the use of sequences that allow for the expression of functional epitopes or 
domains, usually at least about 8 amino acids in length, more usually at least about 15 
amino acids in length, to about 25 amino acids, and up to the complete open reading 
frame of the gene. After introduction of the DNA, the cells containing the construct may 
be selected by means of a selectable marker, the cells expanded and then used for 
expression. 

GST-3 proteins and polypeptides may be expressed in prokaryotes or eukaryotes 
in accordance with conventional ways, depending upon the purpose for expression. For 
large scale production of the protein, a unicellular organism, such as E. coli, B. subtilis, 
S. cerevisiae, insect cells in combination with baculovirus vectors, or cells of a higher 
organism such as vertebrates, particularly mammals, e.g. COS 7 cells, may be used as the 
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expression host cells. In some situations, it is desirable to express the GST-3 gene in 
eukaryotic cells, where the GST-3 protein will benefit from native folding and post- 
translational modifications. Small peptides can also be synthesized in the laboratory. 
Polypeptides that are subsets of the complete GST-3 sequence may be used to identify and 
5 investigate parts of the protein important for function. 

Uses of the Subject GST-3 Polypeptide and Nucleic Acid Compositions 

The subject polypeptide and nucleic acid compositions find use in a variety of 
10 different applications, including research, diagnostic, and therapeutic agent 

screening/discovery/ preparation applications, as well as therapeutic compositions. 



w Research Applications 

Q 

si 

jjj 15 The subject nucleic acid compositions find use in a variety of research 

m applications. Research applications of interest include: the identification of huGST-3 

U homologs; as a source of novel promoter elements; the identification of GST-3 expression 

P regulatory factors; as probes and primers in hybridization applications, e.g. PCR; the 

2 identification of expression patterns in biological specimens; the preparation of cell or 

H 20 animal models for GST-3 function; the preparation of in vitro models for GST-3 function; 
etc. 

Homologs of GST-3 are identified by any of a number of methods. A fragment of 
the provided cDNA may be used as a hybridization probe against a cDNA library from the 
target organism of interest, where low stringency conditions are used. The probe may be 
25 a large fragment, or one or more short degenerate primers. Nucleic acids having sequence 
similarity are detected by hybridization under low stringency conditions, for example, at 
50°C and 6XSSC (0.9 M sodium chloride/0.09 M sodium citrate) and remain bound when 
subjected to washing at 55°C in 1XSSC (0.15 M sodium chloride/0.015 M sodium 
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citrate). Sequence identity may be determined by hybridization under stringent 
conditions, for example, at 50°C or higher and 0.1XSSC (15 mM sodium chloride/01.5 
mM sodium citrate). Nucleic acids having a region of substantial identity to the provided 
GST-3 sequences, e.g. allelic variants, genetically altered versions of the gene, etc., bind 
5 to the provided GST-3 sequences under stringent hybridization conditions. By using 
probes, particularly labeled probes of DNA sequences, one can isolate homologous or 
related genes. 

The sequence of the 5' flanking region may be utilized for promoter elements, 
including enhancer binding sites, that provide for developmental regulation in tissues 
1 0 where GST-3 is expressed. The tissue specific expression is useful for determining the 
pattern of expression, and for providing promoters that mimic the native pattern of 
t expression. Naturally occurring polymorphisms in the promoter region are useful for 

5 determining natural variations in expression, particularly those that may be associated 

sj with disease. 

if 

1 5 Alternatively, mutations may be introduced into the promoter region to determine 

^ the effect of altering expression in experimentally defined systems. Methods for the 

identification of specific DNA motifs involved in the binding of transcriptional factors are 
known in the art, e.g. sequence similarity to known binding motifs, gel retardation studies, 
etc. For examples, see Blackwell et al. (1995), Mol Med. 1:194-205; Mortlock et al. 
|=a 20 (1996), Genome Res. 6:327-33; and Joulin and Richard-Foy (1995), Eur. J. Biochem. 
232:620-626. 

The regulatory sequences may be used to identify cis acting sequences required for 
transcriptional or translational regulation of GST-3 expression, especially in different 
tissues or stages of development, and to identify cis acting sequences and /raws-acting 
25 factors that regulate or mediate GST-3 expression. Such transcription or translational 

control regions may be operably linked to a GST-3 gene in order to promote expression of 
wild type or altered GST-3 or other proteins of interest in cultured cells, or in embryonic, 
fetal or adult tissues, and for gene therapy. 
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Small DNA fragments are useful as primers for PCR, hybridization screening 
probes, etc. Larger DNA fragments, i.e. greater than 100 nt are useful for production of 
the encoded polypeptide, as described in the previous section. For use in amplification 
reactions, such as PCR, a pair of primers will be used. The exact composition of the 

5 primer sequences is not critical to the invention, but for most applications the primers will 
hybridize to the subject sequence under stringent conditions, as known in the art. It is 
preferable to choose a pair of primers that will generate an amplification product of at 
least about 50 nt, preferably at least about 100 nt. Algorithms for the selection of primer 
sequences are generally known, and are available in commercial software packages. 

10 Amplification primers hybridize to complementary strands of DNA, and will prime 

towards each other. 

The DNA may also be used to identify expression of the gene in a biological 
specimen. The manner in which one probes cells for the presence of particular nucleotide 
sequences, as genomic DNA or RNA, is well established in the literature. Briefly, DNA 

15 or mRNA is isolated from a cell sample. The mRNA may be amplified by RT-PCR, 

using reverse transcriptase to form a complementary DNA strand, followed by polymerase 
chain reaction amplification using primers specific for the subject DNA sequences. 
Alternatively, the mRNA sample is separated by gel electrophoresis, transferred to a 
suitable support, e.g. nitrocellulose, nylon, etc., and then probed with a fragment of the 

20 subject DNA as a probe. Other techniques, such as oligonucleotide ligation assays, in situ 
hybridizations, and hybridization to DNA probes arrayed on a solid chip may also find 
use. Detection of mRNA hybridizing to the subject sequence is indicative of GST-3 gene 

expression in the sample. 

The sequence of a GST-3 gene, including flanking promoter regions and coding 
25 regions, may be mutated in various ways known in the art to generate targeted changes in 
promoter strength, sequence of the encoded protein, etc. The DNA sequence or protein 
product of such a mutation will usually be substantially similar to the sequences provided 
herein, i.e. will differ by at least one nucleotide or amino acid, respectively, and may 
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differ by at least two but not more than about ten nucleotides or amino acids. The 
sequence changes may be substitutions, insertions, deletions, or a combination thereof. 
Deletions may further include larger changes, such as deletions of a domain or exon. 
Other modifications of interest include epitope tagging, e.g. with the FLAG system, HA, 
etc. For studies of subcellular localization, fusion proteins with green fluorescent proteins 

(GFP) may be used. 

Techniques for in vitro mutagenesis of cloned genes are known. Examples of 
protocols for site specific mutagenesis may be found in Gustin et al. (1 993), 
Biotechniques 14:22; Barany (1985), Gene 37:1 1 1-23; Colicelli et al. (1985), Mol. Gen. 
Genet. 199:537-9; and Prentki et al. (1984), Gene 29:303-13. Methods for site specific 
mutagenesis can be found in Sambrook et al, Molecular Cloning: A Laboratory Manual, 
CSH Press 1989, pp. 15.3-15.108; Weiner et al. (1993), Gene 126:35-41; Sayers et al. 
(1992), Biotechniques 13:592-6; Jones and Winistorfer (1992), Biotechniques 12:528-30; 
Barton et al. (1990), Nucleic Acids Res 18:7349-55; Marotti and Tomich (1989), Gene 
Anal. Tech. 6:67-70; and Zhu (1989), Anal Biochem 177:120-4. Such mutated genes may 
be used to study structure-function relationships of GST-3, or to alter properties of the 
protein that affect its function or regulation. 

The subject nucleic acids can be used to generate transgenic, non-human animals 
or site specific gene modifications in cell lines. Transgenic animals may be made through 
homologous recombination, where the normal gst-3 locus is altered. Alternatively, a 
nucleic acid construct is randomly integrated into the genome. Vectors for stable 
integration include plasmids, retroviruses and other animal viruses, YACs, and the like. 

The modified cells or animals are useful in the study of gst-3 function and 
regulation. For example, a series of small deletions and/or substitutions may be made in 
the host's native gst-3 gene to determine the role of different exons in oncogenesis, signal 
transduction, etc. Of interest are the use of gst-3 to construct transgenic animal models 
for cancer, where expression of GST-3 is specifically reduced or absent Specific 
constructs of interest include anti-sense gst-3, which will block GST-3 expression, 
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expression of dominant negative gst-3 mutations, and over-expression of GST-3 genes. 
Where a gst-3 sequence is introduced, the introduced sequence may be either a complete 
or partial sequence of a gst-3 gene native to the host, or may be a complete or partial gst^3 
sequence that is exogenous to the host animal, e.g., a human GST-3 sequence. A 
5 detectable marker, such as lac Z may be introduced into the gst-3 locus, where 

upregulation of gst-3 expression will result in an easily detected change in phenotype. 

One may also provide for expression of the gst-3 gene or variants thereof in cells 
or tissues where it is not normally expressed, at levels not normally present in such cells 
or tissues, or at abnormal times of development. 
1 0 DNA constructs for homologous recombination will comprise at least a portion of 

the human GST-3 gene or of a gst-3 gene native to the species of the host animal, wherein 
the gene has the desired genetic modification(s), and includes regions of homology to the . 

5 target locus. DNA constructs for random integration need not include regions of 

O 

"4 homology to mediate recombination. Conveniently, markers for positive and negative 

1 5 selection are included. Methods for generating cells having targeted gene modifications 
through homologous recombination are known in the art. For various techniques for 
I"* transfecting mammalian cells, see Keown et al. (1990), Meth. Enzymol. 185:527-537. 

For embryonic stem (ES) cells, an ES cell line may be employed, or embryonic 
cells may be obtained freshly from a host, e.g. mouse, rat, guinea pig, etc. Such cells are 
20 grown on an appropriate fibroblast-feeder layer or grown in the presence of leukemia 
inhibiting factor (LIF). When ES or embryonic cells have been transformed, they may be 
used to produce transgenic animals. After transformation, the cells are plated onto a 
feeder layer in an appropriate medium. Cells containing the construct may be detected by 
employing a selective medium. After sufficient time for colonies to grow, they are picked 
25 and analyzed for the occurrence of homologous recombination or integration of the 

construct. Those colonies that are positive may then be used for embryo manipulation and 
blastocyst injection. Blastocysts are obtained from 4 to 6 week old superovulated 
females. The ES cells are trypsinized, and the modified cells are injected into the 
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blastocoel of the blastocyst. After injection, the blastocysts are returned to each uterine 
horn of pseudopregnant females. Females are then allowed to go to term and the resulting 
offspring screened for the construct. By providing for a different phenotype of the 
blastocyst and the genetically modified cells, chimeric progeny can be readily detected. 
5 The chimeric animals are screened for the presence of the modified gene and 

males and females having the modification are mated to produce homozygous progeny. If 
the gene alterations cause lethality at some point in development, tissues or organs can be 
maintained as allogeneic or congenic grafts or transplants, or in in vitro culture. The 
transgenic animals may be any non-human mammal, such as laboratory animals, domestic 
1 0 animals, etc. The transgenic animals may be used in functional studies, drug screening, 
etc., e.g. to determine the effect of a candidate drug on GST-3 activity. 

The availability of a number of components in the leukocyte trafficking 
mechanism, such as GlyCAM-1, L-selectin and the subject GST-3 enzyme, and the like/ 
allows in vitro reconstruction of the mechanism, i.e. the production of an in vitro model. 



Diagnostic Applications 



Also provided are methods of diagnosing disease states based on observed levels 
of GST-3 or the expression level of the GST-3 gene in a biological sample of interest. 

20 Samples, as used herein, include biological fluids such as blood, cerebrospinal fluid, tears, 
saliva, lymph, dialysis fluid, semen and the like; organ or tissue culture derived fluids; 
and fluids extracted from physiological tissues. Also included in the term are derivatives 
and fractions of such fluids. The cells may be dissociated, in the case of solid tissues, or 
tissue sections may be analyzed. Alternatively a lysate of the cells may be prepared. 

25 A number of methods are available for detennining the expression level of a gene 

or protein in a particular sample. Diagnosis may be performed by a number of methods to 
determine the absence or presence or altered amounts of normal or abnormal GST-3 in a 
patient sample. For example, detection may utilize staining of cells or histological 
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sections with labeled antibodies, performed in accordance with conventional methods. 
Cells are permeabilized to stain cytoplasmic molecules. The antibodies of interest are 
added to the cell sample, and incubated for a period of time sufficient to allow binding to 
the epitope, usually at least about 10 minutes. The antibody may be labeled with 
5 radioisotopes, enzymes, fluorescers, chemiluminescers, or other labels for direct detection. 
Alternatively, a second stage antibody or reagent is used to amplify the signal. Such 
reagents are well known in the art. For example, the primary antibody may be conjugated 
to biotin, with horseradish peroxidase-conjugated avidin added as a second stage reagent. 
Alternatively, the secondary antibody conjugated to a flourescent compound, e.g. 
1 0 fluorescein, rhodamine, Texas red, etc. Final detection uses a substrate that undergoes a 
color change in the presence of the peroxidase. The absence or presence of antibody 
binding may be determined by various methods, including flow cytometry of dissociated 
cells, microscopy, radiography, scintillation counting, etc. 

Alternatively, one may focus on the expression of GST-3. Biochemical studies 
1 5 may be performed to determine whether a sequence polymorphism in a GST-3 coding 
region or control regions is associated with disease. Disease associated polymorphisms 
U may include deletion or truncation of the gene, mutations that alter expression level, that 

5 affect the activity of the protein, etc. 

Changes in the promoter or enhancer sequence that may affect expression levels of 
20 GST-3 can be compared to expression levels of the normal allele by various methods 
known in the art. Methods for determining promoter or enhancer strength include 
quantitation of the expressed natural protein; insertion of the variant control element into a 
vector with a reporter gene such as (3-galactosidase, luciferase, chloramphenicol 
acetyltransferase, etc. that provides for convenient quantitation; and the like. 
25 A number of methods are available for analyzing nucleic acids for the presence of 

a specific sequence, e.g. a disease associated polymorphism. Where large amounts of 
DNA are available, genomic DNA is used directly. Alternatively, the region of interest is 
cloned into a suitable vector and grown in sufficient quantity for analysis. Cells that 
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express GS7/-J may be used as a source of mRNA, which may be assayed directly or 
reverse transcribed into cDNA for analysis. The nucleic acid may be amplified by 
conventional techniques, such as the polymerase chain reaction (PCR), to provide 
sufficient amounts for analysis. The use of the polymerase chain reaction is described in 
5 Saiki, et al. ( 1 985), Science 239:487, and a review of techniques may be found in 
Sambrook, et al. Molecular Clo ning A Laboratory Manual, CSH Press 1989, 
pp.14.2-14.33. Alternatively, various methods are known in the art that utilize 
oligonucleotide ligation as a means of detecting polymorphisms, for examples see Riley et 
al. (1990), Nucl. Acids Res. 18:2887-2890; and Delahunty et al. (1996), Am. J. Hum. 

10 Genet. 58:1239-1246. 

A detectable label may be included in an amplification reaction. Suitable labels 
include fluorochromes, e.g. fluorescein isothiocyanate (FITC), rhodamine, Texas Red, 
phycoerythrin, allophycocyanin, 6-carboxyfluorescein (6-FAM), 2\7'-dimethoxy-4\5'- 
dichloro-6-carboxyfluorescein (JOE), 6-carboxy-X-rhodamine (ROX), 6-carboxy- 

15 2',4',7',4,7-hexachlorofluorescein (HEX), 5-carboxyfluorescein (5-FAM) or N,N,N\N'- 
tetramemyl-6-carboxyrhoc^mine (TAMRA), radioactive labels, e.g. 32 P, 3S S, 3 H; etc. The 
label may be a two stage system, where the amplified DNA is conjugated to biotin, 
haptens, etc. having a high affinity binding partner, e.g. avidin, specific antibodies, etc., 
where the binding partner is conjugated to a detectable label. The label may be 

20 conjugated to one or both of the primers. Alternatively, the pool of nucleotides used in 
the amplification is labeled, so as to incorporate the label into the amplification product. 

The sample nucleic acid, e.g. amplified or cloned fragment, is analyzed by one of a 
number of methods known in the art. The nucleic acid may be sequenced by dideoxy or 
other methods, and the sequence of bases compared to a wild-type GST-3 sequence. 

25 Hybridization with the variant sequence may also be used to determine its presence, by 
Southern blots, dot blots, etc. The hybridization pattern of a control and variant sequence 
to an array of oligonucleotide probes immobilized on a solid support, as described in US 
5,445,934, or in WO 95/35505, may also be used as a means of detecting the presence of 
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variant sequences. Single strand conformational polymorphism (SSCP) analysis, 
denaturing gradient gel electrophoresis (DGGE), and heteroduplex analysis in gel 
matrices are used to detect conformational changes created by DNA sequence variation as 
alterations in electrophoretic mobility. Alternatively, where a polymorphism creates or 
destroys a recognition site for a restriction endonuclease, the sample is digested with that 
endonuclease, and the products size fractionated to determine whether the fragment was 
digested. Fractionation is performed by gel or capillary electrophoresis, particularly 

acrylamide or agarose gels. 

Screening for mutations in GST-3 may be based on the functional or antigenic 
characteristics of the protein. Protein truncation assays are useful in detecting deletions 
that may affect the biological activity of the protein. Various immunoassays designed to 
detect polymorphisms in GST-3 proteins may be used in screening. Where many diverse 
genetic mutations lead to a particular disease phenotype, functional protein assays have 
proven to be effective screening tools. The activity of the encoded GST-3 protein may be 
determined by comparison with the wild-type protein. 

Diagnostic methods of the subject invention in which the level of GST-3 
expression is of interest will typically involve comparison of the GST-3 nucleic acid 
abundance of a sample of interest with that of a control value to determine any relative 
differences, where the difference may be measured qualitatively and/or quantitatively, 
which differences are then related to the presence or absence of an abnormal GST-3 
expression pattern. A variety of different methods for determine the nucleic acid 
abundance in a sample are known to those of skill in the art, where particular methods of 
interest include those described in: Pietu et al., Genome Res. (June 1996) 6: 492-503; 
Zhao et al., Gene (April 24, 1995) 156: 207-213; Soares , Curr. Opin. Biotechnol. 
(October 1997) 8: 542-546; Raval, J. Pharmacol Toxicol Methods (November 1994) 32: 
125-127; Chalifour et al., Anal. Biochem (February 1, 1994) 216: 299-304; Stolz & Tuan, 
Mol. Biotechnol. (December 19960 6: 225-230; Hong et al., Bioscience Reports (1982) 2: 
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907; and McGraw, Anal. Biochem. (1984) 143: 298. Also of interest are the methods 
disclosed in WO 97/273 17, the disclosure of which is herein incorporated by reference. 



Screening Assays 



The subject GST-3 polypeptides (as well as the HEC-specific KSGal6ST 
homologues) find use in various screening assays designed to identify therapeutic agents. 
Thus one can use a cell model such as a host cell, e.g. COS7 cell, which has been 
cotransfected with a selectin ligand cDNA, e.g. GlyCAM- 1 or CD34 and a GST-3 vector. 
One can then label the transfectants with a labeled sulfate, e.g. » S-labeled sulfate, and 
compare the amount of sulfate incorporation into GlyCAM-1 or CD34 in the presence and 
absence of a candidate inhibitor compound. Alternatively, in a cell-free enzyme activity 
assay, recombinant GST-3 polypeptide may be combined with -S-labeled sulfate donor 
such as pS]-PAPS, candidate inhibitor compound, and an acceptor molecule, which may 
be a synthetic carbohydrate mimicking structures found in mature and/or immature L- 
selectin ligands, or a simple nucleophile capable of accepting sulfate (such as phenolic 
compunds, and the like). The amount of [*S]-sulfate transferred to the receptor by the 
candidate agent is then determined by counting the acceptor-associated radioactivity or 
product quantitation with an antibody specific for the sulfated acceptor, or in a suitable 
scintillation proximity assay format. Alternatively, the candidate inhibitor compound may 
also be combined with a selectin, a non-sulfated selectin ligand precursor, GST-3 and a 
sulfate donor compound under physiological conditions and the resultant amount of 
ligand which is capable of binding to the selectin is determined. Depending on the 
particular method, one or more of, usually one of, the specified components may be 
labeled, where by labeled is meant that the components comprise a detectable moiety, e.g. 
a fluorescent or radioactive tag, or a member of a signal producing system, e.g. biotin for 
binding to an enzyme-streptavidin conjugate in which the enzyme is capable of converting 
a substrate to a chromogenic product. 
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A variety of other reagents may be included in the screening assay. These include 
reagents like salts, neutral proteins, e.g. albumin, detergents, etc that are used to facilitate 
optimal protein-protein binding and/or reduce non-specific or background interactions. 
Reagents that improve the efficiency of the assay, such as protease inhibitors, nuclease 
5 inhibitors, anti-microbial agents, etc. may be used. 

The above screening methods may be designed a number of different ways, where 
a variety of assay configurations and protocols may be employed, as are known in the art. 
For example, one of the components may be bound to a solid support, and the remaining 
components contacted with the support bound component. The above components of the 
10 method may be combined at substantially the same time or at different times. Incubations 
are performed at any suitable temperature, typically between 4 and 40 C. Incubation 
periods are selected for optimum activity, but may also be optimized to facilitate rapid 
high-throughput screening. Typically between 0.1 and 1 hours will be sufficient. 
Following the contact and incubation steps, the subject methods will generally, though not 
15 necessarily, further include a washing step to remove unbound components, where such a 
washing step is generally employed when required to remove label that would give rise to 
U a background signal during detection, such as radioactive or fluorescently labeled non- 

S specifically bound components. Following the optional washing step, the presence of 

5 bound selectin-ligand complexes will then be detected. 

S 20 A variety of different candidate agents may be screened by the above methods. 

Candidate agents encompass numerous chemical classes, though typically they are 
organic molecules, preferably small organic compounds having a molecular weight of 
more than 50 and less than about 2,500 daltons. Candidate agents comprise functional 
groups necessary for structural interaction with proteins, particularly hydrogen bonding, 
25 and typically include at least an amine, carbonyl, hydroxyl or carboxyl group, preferably 
at least two of the functional chemical groups. The candidate agents often comprise 
cyclical carbon or heterocyclic structures and/or aromatic or polyaromatic structures 
substituted with one or more of the above functional groups. Candidate agents are also 
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found among biomolecules including peptides, saccharides, fatty acids, steroids, purines, 
pyrimidines, derivatives, structural analogs or combinations thereof. 

Candidate agents are obtained from a wide variety of sources including libraries of 
synthetic or natural compounds. For example, numerous means are available for random 

5 and directed synthesis of a wide variety of organic compounds and biomolecules, 
including expression of randomized oligonucleotides and oligopeptides. Alternatively, 
libraries of natural compounds in the form of bacterial, fungal, plant and animal extracts 
are available or readily produced. Additionally, natural or synthetically produced libraries 
and compounds are readily modified through conventional chemical, physical and 

10 biochemical means, and may be used to produce combinatorial libraries. Known 

pharmacological agents may be subjected to directed or random chemical modifications, 
such as acylation, alkylation, esterification, amidification, etc. to produce structural 
analogs. 

15 GST-3 Nucleic Acid and Polypeptide Therapeutic Compositions 

The nucleic acid compositions of the subject invention also find use as therapeutic 
agents in situations where one wishes to enhance GST-3 activity in a host. The GST-3 
genes, gene fragments, or the encoded GST-3 protein or protein fragments are useful in 

20 gene therapy to treat disorders associated with GST-3 defects. Expression vectors may be 
used to introduce the GST-3 gene into a cell. Such vectors generally have convenient 
restriction sites located near the promoter sequence to provide for the insertion of nucleic 
acid sequences. Transcription cassettes may be prepared comprising a transcription 
initiation region, the target gene or fragment thereof, and a transcriptional termination 

25 region. The transcription cassettes may be introduced into a variety of vectors, e.g. 

plasmid; retrovirus, e.g. lentivirus; adenovirus; and the like, where the vectors are able to 
transiently or stably be maintained in the cells, usually for a period of at least about one 
day, more usually for a period of at least about several days to several weeks. 
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The gene or GST-3 protein may be introduced into tissues or host cells by any 
number of routes, including viral infection, microinjection, or fusion of vesicles. Jet 
injection may also be used for intramuscular administration, as described by Furth et al. 
(1992), Anal Biochem 205:365-368. The DNA may be coated onto gold microparticles, 
and delivered intradermally by a particle bombardment device, or "gene gun" as described 
the literature (see, for example, Tang et al. (1992), Nature 356:152-154), where gold 
projectiles are coated with the DNA, then bombarded into skin cells. 



in 
microi 



Methods of Modulating Selectin Mediated Binding Events 

Also provided are methods of regulating, including modulating and inhibiting, 
selectin mediated binding events. The selectin receptor of the selectin mediated binding 
event will generally be a receptor which binds to a sulfated ligand under physiological 
conditions and is a member of the selectin family of receptors that have an amino terminal 
C-type lectin domain followed by an EFG-like domain, a variable number of short 
consensus repeats known as SCR, CRP or sushi domains, and share greater than 50% 
homology in their lectin and EFG domains. Of interest is the modulation of selectin 
binding events in which the selectin is L-, P-, or E-selectin. Of particular interest are L- 
selecting mediated binding events. 

Where the selectin mediated binding event occurs in vivo in a host, in one 
embodiment an effective amount of active agent that modulates the activity, usually 
reduces the activity, of GST-3 in vivo, is administered to the host. In another embodiment, 
the modulating agent is targeted to KSGal6ST, or an HEC-specific homologue thereof. In 
yet another embodiment, one or more agents are administered that effectively modulate 
both the KSGal6ST or related activity and the GST-3 activity. The active agent may be a 
variety of different compounds, including a naturally occurring or synthetic small 
molecule compound, an antibody, fragment or derivative thereof, an antisense 
composition, and the like. 
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Naturally occurring or synthetic small molecule compounds of interest include 
numerous chemical classes, though typically they are organic molecules, preferably small 
organic compounds having a molecular weight of more than 50 and less than about 2,500 
daltons. Candidate agents comprise functional groups necessary for structural interaction 
5 with proteins, particularly hydrogen bonding, and typically include at least an amine, 
carbonyl, hydroxyl or carboxyl group, preferably at least two of the functional chemical 
groups. The candidate agents often comprise cyclical carbon or heterocyclic structures 
and/or aromatic or polyaromatic structures substituted with one or more of the above 
functional groups. Candidate agents are also found among biomolecules including 
10 peptides, saccharides, fatty acids, steroids, purines, pyrimidines, derivatives, structural 
analogs or combinations thereof. 

Also of interest as active agent are antibodies that at least reduce, if not inhibit, the 
target activity in the host (e.g. the GST-3 activity and/or the KSGal6ST activity). Suitable 
antibodies are obtained by immunizing a host animal with peptides comprising all or a 
5 1 5 portion of the target protein. Suitable host animals include mouse, rat sheep, goat, 
ill hamster, rabbit, etc. The origin of the protein immunogen may be mouse, human, rat, 

L monkey etc. The host animal will generally be a different species than the immunogen, 

p e.g. human GST-3 used to immunize mice, etc. 

§ The immunogen may comprise the complete protein, or fragments and derivatives 

P 20 thereof Preferred immunogens comprise all or a part of GST-3, where these residues 
contain the post-translation modifications, such as glycosylation, found on the native 
target protein. Immunogens comprising the extracellular domain are produced in a variety 
of ways known in the art, e.g. expression of cloned genes using conventional recombinant 

methods, isolation from HEC, etc. 
25 For preparation of polyclonal antibodies, the first step is immunization of the host 

animal with the target protein, where the target protein will preferably be in substantially 
pure form, comprising less than about 1% contaminant. The immunogen may comprise 
the complete target protein, fragments or derivatives thereof. To increase the immune 



U 

H. 1 
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response of the host animal, the target protein may be combined with an adjuvant, where 
suitable adjuvants include alum, dextran, sulfate, large polymeric anions, oil& water 
emulsions, e.g. Freund's adjuvant, Freund's complete adjuvant, and the like. The target 
protein may also be conjugated to synthetic carrier proteins or synthetic antigens. A 

5 variety of hosts may be immunized to produce the polyclonal antibodies. Such hosts 
include rabbits, guinea pigs, rodents, e.g. mice, rats, sheep, goats, and the like. The target 
protein is administered to the host, usually intradermal^, with an initial dosage followed 
by one or more, usually at least two, additional booster dosages. Following immunization, 
the blood from the host will be collected, followed by separation of the serum from the 

10 blood cells. The Ig present in the resultant antiserum may be further fractionated using 
known methods, such as ammonium salt fractionation, DEAE chromatography, and the 
like. 

Monoclonal antibodies are produced by conventional techniques. Generally, the 
spleen and/or lymph nodes of an immunized host animal provide a source of plasma cells. 

1 5 The plasma cells are immortalized by fusion with myeloma cells to produce hybridoma 
cells. Culture supernatant from individual hybridomas is screened using standard 
techniques to identify those producing antibodies with the desired specificity. Suitable 
animals for production of monoclonal antibodies to the human protein include mouse, rat, 
hamster, etc. To raise antibodies against the mouse protein, the animal will generally be a 

20 hamster, guinea pig, rabbit, etc. The antibody may be purified from the hybridoma cell 
supernatants or ascites fluid by conventional techniques, e.g. affinity chromatography 
using GST-3 bound to an insoluble support, protein A sepharose, etc. 

The antibody may be produced as a single chain, instead of the normal multimeric 
structure. Single chain antibodies are described in Jost et al. (1994) JM 269:26267-73, 

25 and others. DNA sequences encoding the variable region of the heavy chain and the 
variable region of the light chain are ligated to a spacer encoding at least about 4 amino 
acids of small neutral amino acids, including glycine and/or serine. The protein encoded 
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by this fusion allows assembly of a functional variable region that retains the specificity 
and affinity of the original antibody. 

For in vivo use, particularly for injection into humans, it is desirable to decrease 
the antigenicity of the antibody. An immune response of a recipient against the blocking 
agent will potentially decrease the period of time that the therapy is effective. Methods of 
humanizing antibodies are known in the art. The humanized antibody may be the product 
of an animal having transgenic human immunoglobulin constant region genes (see for 
example International Patent Applications WO 90/10077 and WO 90/04036). 
Alternatively, the antibody of interest may be engineered by recombinant DNA techniques 
to substitute the CHI, CH2, CH3, hinge domains, and/or the framework domain with the 
corresponding human sequence (see WO 92/02190). 

The use of Ig cDNA for construction of chimeric immunoglobulin genes is known 
in the art (Liu et al. (1987) P.NAS, 84:3439 and (1987) JJmmifflol 139:3521). mRNA 
is isolated from a hybridoma or other cell producing the antibody and used to produce 
cDN A. The cDNA of interest may be amplified by the polymerase chain reaction using 
specific primers (U.S. Patent nos. 4,683,195 and 4,683,202). Alternatively, a library is 
made and screened to isolate the sequence of interest. The DNA sequence encoding the 
variable region of the antibody is then fused to human constant region sequences. The 
sequences of human constant regions genes may be found in Kabat et al. (1991) 

Sequenegs olfjoMas ^tt™"'™"»' T " terest - Nm publication n0 ' 91 ' 3242 ' Human 

C region genes are readily available from known clones. The choice of isotype will be 
guided by the desired effector functions, such as complement fixation, or activity in 
antibody-dependent cellular cytotoxicity. Preferred isotypes are IgGl, IgG3 and IgG4. 
Either of the human light chain constant regions, kappa or lambda, may be used. The 
chimeric, humanized antibody is then expressed by conventional methods. 

Antibody fragments, such as Fv, F(ab') 2 and Fab may be prepared by cleavage of 
the intact protein, e.g. by protease or chemical cleavage. Alternatively, a truncated gene is 
designed. For example, a chimeric gene encoding a portion of the F(ab') 2 fragment would 
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include DNA sequences encoding the CHI domain and hinge region of the H chain, 
followed by a translation^ stop codon to yield the truncated molecule. 

Consensus sequences of H and L J regions may be used to design oligonucleotides 
for use as primers to introduce useful restriction sites into the J region for subsequent 
linkage of V region segments to human C region segments. C region cDNA can be 
modified by site directed mutagenesis to place a restriction site at the analogous position 

in the human sequence. 

Expression vectors include plasmids, retroviruses, YACs, EBV derived episomes, 
and the like. A convenient vector is one that encodes a functionally complete human CH 
or CL immunoglobulin sequence, with appropriate restriction sites engineered so that any 
VH or VL sequence can be easily inserted and expressed. In such vectors, splicing 
usually occurs between the splice donor site in the inserted J region and the splice 
acceptor site preceding the human C region, and also at the splice regions that occur 
within the human CHexons. Polyadenylation and transcription termination occur at 
native chromosomal sites downstream of the coding regions. The resulting chimeric 
antibody may be joined to any strong promoter, including retroviral LTRs, e.g. SV-40 
early promoter, (Okayama et al (1983) Mol. Cell. Bio. 3:280), Rous sarcoma virus LTR 
(Gorman et al (1982) P.N.A.S. 79:6777), and moloney murine leukemia virus LTR 
(Grosschedl et al (1985) Cell 41:885); native Ig promoters, etc. 

In yet other embodiments of the invention, the active agent is an agent that 
modulates, and generally decreases or down regulates, the expression of the gene 
encoding the target protein in the host. For example, antisense molecules can be used to 
down-regulate expression of GST-3 in cells. The anti-sense reagent may be antisense 
oligonucleotides (ODN), particularly synthetic ODN having chemical modifications from 
native nucleic acids, or nucleic acid constructs that express such anti-sense molecules as 
RNA. The antisense sequence is complementary to the mRNA of the targeted gene, and 
inhibits expression of the targeted gene products. Antisense molecules inhibit gene 
expression through various mechanisms, e.g. by reducing the amount of mRNA available 
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for translation, through activation of RNAse H, or steric hindrance. One or a combination 
of antisense molecules may be administered, where a combination may comprise multiple 
different sequences. 

Antisense molecules may be produced by expression of all or a part of the target 
gene sequence in an appropriate vector, where the transcriptional initiation is oriented 
such that an antisense strand is produced as an RNA molecule. Alternatively, the 
antisense molecule is a synthetic oligonucleotide. Antisense oligonucleotides will 
generally be at least about 7, usually at least about 12, more usually at least about 20 
nucleotides in length, and not more than about 500, usually not more than about 50, more 
usually not more than about 35 nucleotides in length, where the length is governed by 
efficiency of inhibition, specificity, including absence of cross-reactivity, and the like. It 
has been found that short oligonucleotides, of from 7 to 8 bases in length, can be strong 
and selective inhibitors of gene expression (see Wagner et al (1996), Nature Biotechnol. 
14:840-844). 

A specific region or regions of the endogenous sense strand mRNA sequence is 
chosen to be complemented by the antisense sequence. Selection of a specific sequence 
for the oligonucleotide may use an empirical method, where several candidate sequences 
are assayed for inhibition of expression of the target gene in an in vitro or animal model. 
A combination of sequences may also be used, where several regions of the mRNA 
sequence are selected for antisense complementation. 

Antisense oligonucleotides may be chemically synthesized by methods known in 
the art (see Wagner et al (1993), supra, and Milligan et al, supra) Preferred 
oligonucleotides are chemically modified from the native phosphodiester structure, in 
order to increase their intracellular stability and binding affinity. A number of such 
modifications have been described in the literature, which alter the chemistry of the 
backbone, sugars or heterocyclic bases. 

Among useful changes in the backbone chemistry are phosphorothioates; 
phosphorodithioates, where both of the non-bridging oxygens are substituted with sulfur; 
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phosphoroamidites; alkyl phosphotriesters and boranophosphates. Achiral phosphate 
derivatives include 3'-0'-5'-S-phosphorothioate, 3'-S-5'-0-phosphorothioate, 3'-CH2-5'- 
O-phosphonate and 3^NH-5'-0-phosphoroamidate. Peptide nucleic acids replace the 
entire ribose phosphodiester backbone with a peptide linkage. Sugar modifications are 
also used to enhance stability and affinity. The a-anomer of deoxyribose may be used, 
where the base is inverted with respect to the natural p-anomer. The 2'-OH of the ribose 
sugar may be altered to form 2'-0-methyl or 2'-0-allyl sugars, which provides resistance 
to degradation without comprising affinity. Modification of the heterocyclic bases must 
maintain proper base pairing. Some useful substitutions include deoxyuridine for 
deoxythymidine; 5-methyl-2'-deoxycytidine and 5-bromo-2'-deoxycytidine for 
deoxycytidine. 5- propynyl-2'-deoxyuridine and 5-propynyl-2'-deoxycytidine have been 
shown to increase affinity and biological activity when substituted for deoxythymidine 
and deoxycytidine, respectively. 

As an alternative to anti-sense inhibitors, catalytic nucleic acid compounds, e.g. 
ribozymes, anti-sense conjugates, etc. may be used to inhibit gene expression. Ribozymes 
may be synthesized in vitro and administered to the patient, or may be encoded on an 
expression vector, from which the ribozyme is synthesized in the targeted cell (for 
example, see International patent application WO 9523225, and Beigelman et al. (1995), 
Nucl. Acids Res. 23:4434-42). Examples of oligonucleotides with catalytic activity are 
described in WO 9506764. Conjugates of anti-sense ODN with a metal complex, e.g. 
terpyridylCu(II), capable of mediating mRNA hydrolysis are described in Bashkin et al. 
(1995), Appl. Biochem. BiotechnoL 54:43-56. 

As mentioned above, an effective amount of the active agent is administered to the 
host, where "effective amount" means a dosage sufficient to produce a desired result. 
Generally, the desired result is at least a reduction in the amount of selectin binding as 

compared to a control. 

In the subject methods, the active agent(s) may be administered to the host using 
any convenient means capable of resulting in the desired inhibition of selectin binding. 
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Thus, the agent can be incorporated into a variety of formulations for therapeutic 
administration. More particularly, the agents of the present invention can be formulated 
into pharmaceutical compositions by combination with appropriate, pharmaceutically 
acceptable carriers or diluents, and may be formulated into preparations in solid, 
5 semi-solid, liquid or gaseous forms, such as tablets, capsules, powders, granules, 
ointments, solutions, suppositories, injections, inhalants and aerosols. 

As such, administration of the agents can be achieved in various ways, including 
oral, buccal, rectal, parenteral, intraperitoneal, intradermal, transdermal, intracheal,etc, 
administration. 

10 In pharmaceutical dosage forms, the agents may be administered in the form of 

their pharmaceutically acceptable salts, or they may also be used alone or in appropriate 
association, as well as in combination, with other pharmaceutically active compounds. 
The following methods and excipients are merely exemplary and are in no way limiting. 
For oral preparations, the agents can be used alone or in combination with 
15 appropriate additives to make tablets, powders, granules or capsules, for example, with 
conventional additives, such as lactose, mannitol, corn starch or potato starch; with 
binders, such as crystalline cellulose, cellulose derivatives, acacia, com starch or gelatins; 
with disintegrators, such as corn starch, potato starch or sodium carboxymethylcellulose; 
with lubricants, such as talc or magnesium stearate; and if desired, with diluents, buffering 
20 agents, moistening agents, preservatives and flavoring agents. 

The agents can be formulated into preparations for injection by dissolving, 
suspending or emulsifying them in an aqueous or nonaqueous solvent, such as vegetable 
or other similar oils, synthetic aliphatic acid glycerides, esters of higher aliphatic acids or 
propylene glycol; and if desired, with conventional additives such as solubilizers, isotonic 
25 agents, suspending agents, emulsifying agents, stabilizers and preservatives. 

The agents can be utilized in aerosol formulation to be administered via inhalation. 
The compounds of the present invention can be formulated into pressurized acceptable 
propellants such as dichlorodifluoromethane, propane, nitrogen and the like. 
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Furthermore, the agents can be made into suppositories by mixing with a variety 
of bases such as emulsifying bases or water-soluble bases. The compounds of the present 
invention can be administered rectally via a suppository. The suppository can include 
vehicles such as cocoa butter, carbowaxes and polyethylene glycols, which melt at body 
temperature, yet are solidified at room temperature. 

Unit dosage forms for oral or rectal administration such as syrups, elixirs, and 
suspensions may be provided wherein each dosage unit, for example, teaspoonful, 
tablespoonful, tablet or suppository, contains a predetermined amount of the composition 
containing one or more inhibitors. Similarly, unit dosage forms for injection or 
intravenous administration may comprise the inhibitors) in a composition as a solution in 
sterile water, normal saline or another pharmaceutically acceptable carrier. 

The term "unit dosage form," as used herein, refers to physically discrete units 
suitable as unitary dosages for human and animal subjects, each unit containing a 
predetermined quantity of compounds of the present invention calculated in an amount 
sufficient to produce the desired effect in association with a pharmaceutically acceptable 
diluent, carrier or vehicle. The specifications for the novel unit dosage forms of the 
present invention depend on the particular compound employed and the effect to be 
achieved, and the pharmacodynamics associated with each compound in the host. 

The pharmaceutically acceptable excipients, such as vehicles, adjuvants, carriers 
or diluents, are readily available to the public. Moreover, pharmaceutically acceptable 
auxiliary substances, such as pH adjusting and buffering agents, tonicity adjusting agents, 
stabilizers, wetting agents and the like, are readily available to the public. 

Where the agent is a polypeptide, polynucleotide, analog or mimetic thereof, e.g. 
antisense composition, it may be introduced into tissues or host cells by any number of 
routes, including viral infection, microinjection, or fusion of vesicles. Jet injection may 
also be used for intramuscular administration, as described by Furth et al. (1992), Anal 
Biochem 205:365-368. The DNA may be coated onto gold microparticles, and delivered 
intradermally by a particle bombardment device, or "gene gun" as described in the 
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literature (see, for example, Tang et al (1992), Nature 356:1 52-154), where gold 
microprojectiles are coated with the therapeutic DNA, then bombarded into skin cells. 

Those of skill will readily appreciate that dose levels can vary as a function of the 
specific compound, the severity of the symptoms and the susceptibility of the subject to 
5 side effects. Preferred dosages for a given compound are readily determinable by those of 
skill in the art by a variety of means. 

The subject methods find use in the treatment of a variety of different disease 
conditions involving selectin binding interactions, particularly L-, E- or P- selectin, and 
more particularly L-selectin mediated binding events. Such disease conditions include 
1 0 those disease conditions associated with or resulting from the homing of leukocytes to 
sites of inflammation, the normal homing of lymphocytes to secondary lymph organs; and 

l~ the like. Accordingly, specific disease conditions that may be treated with the subject 

O 

5 methods include: acute or chronic inflammation; autoimmune and related disorders, e.g. 

!j systemic lupus erythematosus, rheumatoid arthritis, polyarteritis nodosa, polymyositis and 

m 15 dermatomyositis, progressive systemic sclerosis (diffuse scleroderma), 

IP 

IV glomerulonephritis, myasthenia gravis, Sjogren's syndrome, Hashimoto's disease and 

U Graves' disease, adrenalitis, hypoparathyroidism, and associated diseases; pernicious 

q anemia; diabetes; multiple sclerosis and related demyelinating diseases; uveitis 

| pemphigus and pemphigoid; cirrhosis and other diseases of the liver; ulcerative colitis; 

£ 20 myocarditis; regional enteritis; adult respiratory distress syndrome; local manifestations of 
drug reactions (dermatitis, etc.); inflammation-associated or allergic reaction patterns of 
the skin; atopic dermatitis and infantile eczema; contact dermatitis, psoriasis lichen 
planus; allergic enteropathies; atopic diseases, e.g. allergic rhinitis and bronchial asthma; 
transplant rejection (heart, kidney, lung, liver, pancreatic islet cell, others); 
25 hypersensitivity or destructive responses to infectious agents; poststreptococcal diseases 
e.g. cardiac manifestations of rheumatic fever, etc.; tissue rejection during transplantation; 
and the like. 
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By treatment is meant at least an amelioration of the symptoms associated with the 
pathological condition afflicting the host, where amelioration is used in a broad sense to 
refer to at least a reduction in the magnitude of a parameter, e.g. symptom, associated with 
the pathological condition being treated, such as inflammation and pain associated 
therewith. As such, treatment also includes situations where the pathological condition, or 
at least symptoms associated therewith, are completely inhibited, e.g. prevented from 
happening, or stopped, e.g. terminated, such that the host no longer suffers from the 
pathological condition, or at least the symptoms that characterize the pathological 
condition. 

A variety of hosts are treatable according to the subject methods. Generally such 
hosts are "mammals" or "mammalian," where these terms are used broadly to describe 
organisms which are within the class mammalia, including the orders carnivore (e.g., dogs 
and cats), rodentia (e.g., mice, guinea pigs, and rats), and primates (e.g., humans, 
chimpanzees, and monkeys). In many embodiments, the hosts will be humans. 

Kits with unit doses of the active agent, usually in oral or injectable doses, are 
provided. In such kits, in addition to the containers containing the unit doses will be an 
informational package insert describing the use and attendant benefits of the drugs in 
treating pathological condition of interest. Preferred compounds and unit doses are those 
described herein above. 



The following examples are offered primarily for purposes of illustration. It will 
be readily apparent to those skilled in the art that the formulations, dosages, methods of 
administration, and other parameters of this invention may be further modified or 
substituted in various ways without departing from the spirit and scope of the invention. 
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FyPRRTMENTAL 



IDENTIFICATION OF GST-1 & GST-2 



Human ESTs that are related to the C6ST/KSST at the protein level were searched 
by using TBLASTN which compares a protein query sequence against a nucleotide 
sequence database translated in all 6 reading frames. See Karlin, Samuel and Stephen F. 
Altschul (1990), Methods for assessing the statistical significance of molecular sequence 
features by using general scoring schemes, Proc. Natl. Acad. Sci. USA 87:2264-68; and 
Karlin, Samuel and Stephen F. Altschul (1993), Applications and statistics for multiple 
high-scoring segments in molecular sequences, Proc. Natl. Acad. Sci. USA 90:5873-7. 
As shown in Table 1 , several ESTs, ranging from 228 to 861 bases, resulted in high 
scores. When compared over their entire length with the C6ST/KSST, the predicted 
amino acid identities ranged from 27% to 57%. 

Table 1. Human ESTs related to the chick chondroitin 6/keratan sulfate sulfotransferase 



No. 



mRNA source 



1 infant brain 

2 infant brain 

3 adult heart 

4 fetal lung 

5 fetal liver/spleen 

6 teratocarcinoma 



Covering AA of 
C6ST/KSST 

347-451 
140-216 
405-451 
89-375 
332-403 
100-165 



Identity of AA 
Sequences (%) 



Contig 
Assignmen 
t 



42 
57 
42 
30 
27 
31 



1 
1 
1 
2 
2 
2 



The cDNA clones corresponding to each EST were obtained from the ATCC and 
Research Genetics, Inc.,(Huntsville, AL) and sequenced in full to obtain further 3' 
information. Sequence alignment analysis revealed the presence of two distinct sequences 
("contigs"), covering 74% (contig 1, starting at amino acid 137) and 78% (contig 2, 
starting at amino acid 100). Contig sequences 1 and 2 are apparently both complete at the 
3- end, since both contain poly A tracts at the end of their 3' untranslated regions (UTR). 
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Expression of transcripts corresponding to the two contigs was examined in a number of 
human tissues by Northern analysis. Blots of poly A + RNA (Clontech, Palo Alto, 
California) were probed at high stringency with probes derived from the EST clones. A 
3.1 kb band corresponding to contig 1 was detected in multiple human organs (heart, 
placenta, lung, liver, skeletal muscle, kidney, pancreas, spleen, lymph nodes, thymus) but 
most strongly in brain. Contig 2 was also broadly expressed in various organs (3.3 kb 
band), including lymph node and brain. 

Full-length cDNAs containing the two contigs and predicting CS6T/KSST homologs 
were obtained by screening a human fetal brain XZAP cDNA library (Stratagene, La Jolla, 
California) with labeled 700-800 bp restriction fragments (from EST 2 for contig 1 and 
from EST 5 for contig 2). Briefly, 10* plaque-forming units were used to infect E. colU 
which were then distributed on 20 plates. Duplicate filter lifts were performed. The 
probes were labeled with 32 P by random priming (Amersham), and hybridization was 
performed at 60°C with high stringency washing. In both screens, multiple positive spots 
were obtained in the first round. Single positive clones were obtained after either the 
second or third round of screening. Multiple clones were sequenced for each gene and the 
presence of the ESTs was confirmed. As will be described below, the cDNAs contain 
open reading frames that encode predicted proteins of high homology to CS6T/KSST. 
The proteins encoded by these cDNAs were designated as GST 1 and GST 2, where 
"GST" denotes "glycosylsulfotransferase." GST 1 has been independently cloned and 
assigned the name «KSGal6ST) by Fukuta et al., J. Biol. Chem. (1997) 272: 32321-8. 

II. Identification of GST-3 

ESTs potentially coding for novel human glycosyl sulfotransferases other than GST-1 & 2 
were identified through a secondary homology screen, in which the peptide sequences of 
GST-1 and GST-2 were used as template in two parallel TBLASTN searches against a 
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public (dbest) and a private genomic database (Lifeseq, Incyte Pharmaceuticals, Palo Alto, 
CA). Only matches that produced alignments with smallest sum probabilities P(N) < 10" 5 
were selected from the output of the search, imported into a contig assembler (Sequencher 
3.0, Gene Codes Corporation, Ann Arbor, Michigan) and assembled using the default 
settings of the program. The vast majority of these matches assembled into two contigs 
defined by GST-1 and GST-2. However, four particular ESTs found only in the private 
Lifeseq database did not assemble into either contig or with each other. These were 
termed GST-3 through GST-6. 

III. GST-3 Is Expressed in High Endothelial Venules 



In order to investigate if any of the above putative human glycosyl sulfotransferases or 
similar genes were expressed in high endothelial venules, an HEV-derived cDNA pool for 
use as template in homology polymerase chain reaction (PCR) was prepared. In order to 

15 clone HEV genes, an expression library from the aforementioned HEV-derived cDNA 
was also generated. Briefly, total RNA (45 M g) was isolated from 10 7 HEC. Since the 
amount of poly A + RNA was too limited for preparation of a cDNA library by 
conventional procedures, the Capfinder (SMART™) cDNA technology (CLONTECH) 
was used. In this technique, the reverse transcription reaction is primed by a modified 

20 oligo(dT) primer (containing a Not I site) and a "SMART" oligonucleotide which anneals 
to an oligo dC stretch added by reverse transcriptase (RT) at the 3' end of the first strand 
cDNA. The annealed oligonucleotide serves as a "switch" template for RT, resulting in 
the generation of single stranded cDNAs which are enriched for full length sequences and 
contain universal primer sites for subsequent long distance PCR amplification. This 

25 technology therefore makes it possible to generate high quality double stranded cDNA 
(from limiting amounts of RNA), which is sufficient to construct a library. According to 
the published test results for this technology, Capfinder cDNA is comparable to 
conventionally prepared cDNA in gene representation and is significantly enriched for full 
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length cDNAs. The HEC cDNA generated by the Capfinder technology was evaluated by 
PCR for the presence of the following genes, which are known or suspected to be 
expressed in HEC: CD34 (Baumhueter et al., Science (1993) 262: 436-438), hevin (Girard 
& Springer, Immunity (1995) 2:1 13-123), fucosyltransferase VII (Maly et al., Cell (1996) 
5 86: 643-653); p-l,6-N-acetylglucosaminyl-transferase (C2GnT) (Bierhuizen & Fukuda., 
Proc. Nad. Acad. Sci. USA (1992) 89: 9326-9330), and fractalkine (Schall, Immunology 
Today (1997) 18:147). By this analysis, all of these cDNAs were detected in the HEC 
cDNA, and at least two of them (CD34 and C2GnT) were full length. With this validation 
of the HEC cDNA, a library was generated as follows: the double-stranded cDNA was 
10 ligated to Eco RI adapters, digested with Not I and cloned into the Not I and Eco RI sites 
of pCDNAl.l (Invitrogen, Inc, Carlsbad, CA), which is a modified version of the 
!= eucaryotic expression vector pCDM8 (Aruffo et al., Proc. Natl. Acad. Sci. USA (1987) 

I 84: 8753-8577). The resulting libary has a complexity of 500,000 independent clones and 

an average insert size of 1 . 1 kb, according to the characterization performed by 
S 15 CLONTECH. 



HEV-derived Capfinder cDNA was used as a template for homology PCR with 
degenerate primers. In-frame translations of GST-1 and GST-2 were aligned with other 
1 known sulfotransferase protein sequences retrieved from the public databases. See Fig. 3. 

20 Three putative consensus regions were identified, and the following degenerate primers 
were synthesized to encode within these consensus regions a maximal number of possible 
permutations at the amino-acid level in order to cover a maximal number of novel 
sulfotransferases that may fall into these patterns. 
These primers were (I = inosine): 
25 A+: 5' TWYTWY CTI TWYGARCC I CT I TGGCAYST 
B+: 5' CTIAAICTISTICWRCTISTIMGIRAYCC 
B-: 5' GGRTYICKIASIAGYWGIASIAGITTIAG 
C-: 5' AGRTCYTCRTAICKIAGIAGIAKRTA 
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(SEQ 


ID 


NO: 


03) 


3' 


(SEQ 


ID 


NO: 


04) 


3' 


(SEQ 


ID 


NO: 


05) 


3' 


(SEQ 


ID 


NO: 


06) 



In the first round PCR each reaction contained in a total volume of 50 (A 100 mM Tris-Cl 
(pH 8.3), 0.5 M KC1, 15 mM MgCl 2 , forward and reverse primer (0.5 each), dATP, 
dCTP, dGTP, and dTTP (100 fM each), 0.25 units Thermus aquaticus DNA polymerase 
5 (Boehringer Mannheim #1647679), and 0.5 (A of HEV-message derived Cap-finder 
cDNA preparation (generated by Clontech Inc., cf. above). In "no template" control 
samples the cDNA was omitted. 

Each reaction was cycled as follows: hold 4 min @ 94°C, then 35 cycles of [30 sec @ 
10 94°C followed by 30 sec @ 40°C followed by 1 min @ 72°C], then hold 6 min @ 72°C. 
Following completion of PCR a 20 fA aliqout of each reaction was analysed by standard 
M- horizontal agarose (1 %) gel electrophoresis. No discernable band pattern was observed 

O 

P (data not shown) 

6 

5j 15 Therefore the unfractionated products of the first round PCR were used as template in a 
W second round PCR. Here each reaction contained in a total volume of 50 fA 1 00 mM Tris- 

l* CI (pH 8.3), 0.5 M fcCl, 1 5 mM MgCl 2 , forward and reverse primer (0.5 fM each), dATP, 

dCTP, dGTP, and dTTP (100 fM each), 0.25 units Thermus aquaticus DNA polymerase 
| (Boehringer Mannheim #1647679), and 1 (A of total PCR reaction from round 1 (cf 

20 above). 

Each reaction was cycled as follows: hold 4 min @ 94°C, then 35 cycles of [30 sec @ 
94°C followed by 30 sec @ 45 °C followed by 1 min @ 72°C], then hold 6 min @ 72°C. 
The entire reactions were then fractionated by standard horizontal agarose (1 %) gel 
25 electrophoresis. Three bands appearing at positions 2.1, 2.2 and 2.3, were excised and 
DNA eluted from the gel using the QIAquick PCR purification kit (Qiagen Inc. #28104). 
Eluted DNA was then subcloned into the TA cloning vector pCR-II (stratagene) and E- 
coli transformed with recombinant plasmids. For each band eight colonies were 
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expanded, and plasmid DNAs isolated and sequenced using standard dideoxynucleotide 
chain termination methodology with fluorimetric detection. 

In order to map the amplicons generated by the above homology PCR, public (dbest) and 
5 private (Incyte IncO EST databases were screened with by the TBLASTX algorithm 
(Karlin & Altschul, 1990 & 1993; cf. above) using the sequences of these amplicons as 
query sequences. Four sequences amplified from from HEV-cDNA with primers B+ and 
C- aligned with >95 % overall identity to Incyte EST #2620445 defined perviously as 
GST-3 (cf. above). All other query sequences did not pick up statistically significant 
1 0 matches in the specified databases. 

IV. GST-3 Is Expressed in HEC 

From the extended DNA sequence of Lifeseq clone #2620445 = GST-3 we designed a 
nondegenerate primer pair located within the incomplete open reading frame encoded by 
15 this EST. 

Forward: 5 'AAACTCAAGAAGGAGGACCAACCCTACTATGTGATGC 3" (SEQ ID 
NO: 07) 

20 Reverse: 5 1 ATAAAGCTTGTGGATTTGTTCAGGGACATTCCAGGTAGACAGAAGAT 
3 • (SEQ ID NO: 08) 

Using RT-PCR, a PCR product of appropriate length (500 bp) was amplified from HEC 
cDNA with this primer pair. This product could not be amplified from cDNAs prepared 
25 from tonsillar lymphocytes or primary cultured human umbilical vein endothelial cells 
(HUVEC). Control primers for hypoxanthine phosphoribosyl transferase (HPRT, a 
ubiquitously expressed cellular "housekeeping enzyme") were used in parallel to establish 
that similar amounts of template were used in each set of PCR reactions and that none of 
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the template DN As were substantially degraded. These RT-PCR results confirm that the 
gene corresponding to the PCR product is expressed in HEC but not in lymphocytes or 
HUVEC. Northern analysis has failed to detect mRNA for the new gene in a variety of 
human tissues and organs (but did detect a signal in liver, pancrease, lymph node and 
5 HEC, establishing that the expression of this gene is highly restricted. Also, by in situ 
hybridization, transcripts for mouse GST-3 were selectively found in lymph node HEV. 

V. GST-3 Cloning 

A full length cDNA from the HEC library described in the previous section was cloned as 
10 follows. The pool selection procedure described in Bakker et al., J. Biol. Chem. (1997) 
272:29942-6) was used to quickly isolate the cDNA. It was first established that the 
relevant template was contained within the library by successfully amplifying the above 
described PCR product from the library stock comprising the entire library. An aliquot of 
this bacterial stock was then divided into 200 pools of 2000-3000 colonies each. Each 
1 5 pool was plated out on LB plates and the colonies were allowed to grow to a healthy size. 
The colonies were harvested in LB and allowed to grow further at 37°C, at which time 
glycerol stocks were prepared from each pool. By PCR analysis of the pools, nine 
positives were identified in this first round of screening. The corresponding bacterial 
stock for one of these pools was then titered and plated at 100 colony forming units (cfu) 
20 per plate in 40 plates. Plates were grown, harvested, preserved and analyzed as in the first 
round, resulting in the identification of three positive subpools. At this stage, one of the 
three positive pools was plated at a density (300 cfu) so that individual colonies could be 
analyzed by PCR. One cDNA clone was obtained by this approach. It contains a 
complete open reading frame which encodes a novel 386 amino acid protein, termed GST- 
25 3. This full length cDNA sequence was then used as template in a BLASTN search of the 
public (dbest) and Lifeseq EST databases. In this manner, two so far unrecognized ESTs 
#2617407 (from Lifeseq; derived from a human gall bladder cDNA library) and g2262929 
(from the mouse EST collection included in the dbest database, derived from a murine 
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mammary gland cDNA library) were identified. The former EST included the 5' end of 
GST-3 open reading frame. Since this EST was generated with an oligo dT-primer, it 
therefore contains the entire open.reading frame plus all 3' untranslated sequence of the 
human GST-3 cDNA. This EST was retrieved from Incyte in the form of a plasmid- 
transformed E.coli culture, expanded into Luria Bertoni Medium (with 0.1 mg/ml 
Ampicillin). The plasmid was isolated from a 500 ml culture and sequenced using 
standard dideoxynucleotide chain termination methodology with fluorimetric detection. 
Since no PCR-step was used in generating the full length GST-3 Lifeseq EST Incyte 
#2167407 ( in contrast to the Cap-finder methodology employed in generation of our 
HEV-library), the GST-3 sequence obtained from Incyte #2617407 is free of PCR errors. 
The sequence is provided in SEQ ID NO:02 and shown in Fig. 1 . 

VI. Characterization of GST-3 

A. Three cDNA clones which encode three different human homologs for 
C6ST/KSST have been obtained. The predicted GST proteins are type 2 membrane 
proteins 41 1, 484, and 386 amino acids in length, respectively. Each has a relatively short 
transmembrane domain and a short amino-terminal cytoplasmic tail. Table 2 demontrates 
the high homologies among the 3 human proteins and the chick CS6T/KSST. Overall 
homologies at the amino acid level ranged from 28% to 40% identity. Strikingly, there 
are three regions of 16 to 29 amino acids in which identity among the three GSTs ranged 
from 50-59% and similarity ranged from 65-94%. See Fig. 4. Fig. 4 shows that all four of 
the sulfotransferases are type II transmembrane proteins with short cytoplasmic tails 
(TM). There are three regions (region A, B and C) in which identities among the human 
GSTs range from 50-59% and similarities range from 65 to 94%. The amino acid 
sequence for the regions are: 
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(T/S)XRSGSSF(V/F)G(Q/E)LFXQX(P/L)(D/E)VF(F/Y)L(F/Y/M)EP(IJV/AXW/Y)HV 

L(N/D)L(Km)(V/IXW)XLVRDPR(A/G)(V/IXLAF) 

PXXL(Q/K)XXY(L/M)(L/V)VRYEDL(A/V)XXP 

Table 2: Percent amino acid identities for the predicted coding sequences 





GST 1 


GST 2 


GST 3 


CS6T/KSST 


GST 1 
GST 2 
GST 3 




31 


32 
35 


40 
28 
31 



10 



B. GST-1 is the same as the sulfotransferase reported by Fukuta et al. supra (1997) 
and named KSGal6ST. GST-3 ( HEC-GlcNAc6ST), is a novel GlcNAc-6- 
sulfotransferase. These two sulfotransferases, together with the chicken C6/KSST 
(Fukuta et al., 1 995, supra) and the recently reported human chondroitin-6- 

15 sulfotransferase (C6ST, specificity for C-6 of GalNAc, Fukuta, et al., Biophys. Acta. 
(1998)1399:57-61) and GlcNAc-6-sulfotransferase (GlcNAc6ST) (Uchimura et al., J. 
Biochem (Tokyo) (1998b) 124:670-678) (which corresponds to GST-2) constitute a family 
of highly conserved enzymes. Overall amino acid identities within the family range from 
27 to 42%. These enzymes are type II transmembrane proteins with short cytoplasmic 

20 tails, features which are typical of glycosyltransferases and carbohydrate sulfotransferases, 
with the exception of the heparan sulfate D-glucosamino-3-O-sulfotransferase (Shworak 
et al., 1997). 

Within this new family of carbohydrate sulfotransferases, there are three regions of 
25 amino acid sequence in which amino acid identity ranges from 45-54% and similarity 

from 80-90% (Fig. 5). Regions one and two contain elements that conform to the recently 
described consensus binding motifs for the high energy phosphate donor, 3'- 
phosphoadenosine-5'-phosphosulfate. These elements are found in all sulfotransferases 
characterized to date (Kakuta et al., Trends Biochem. Sci. (1998)23:129-130. In addition, 
30 regions one and three contain two stretches of sequence of 1 1 amino acids each 
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(corresponding to amino acids 124-134 and 328-339, respectively, in the HEC- 
GlcNAc6ST sequence) that are highly conserved (>90% similarity). This suggests that 
these two elements contribute to a binding pocket that interacts with the 6-hydroxyl group 
of an appropriate oligosaccharide acceptor (Gal, GalNAc, or GlcNAc) to bring it into 
apposition with the donor phosphosulfate group. 

VII. Characterization of GST-3 (HEC-GlcNAc6STt) and KSGal6ST 



A. GST-3 Sulfates GlyCAM-1 

In expression experiments, the sulfotransferase activity of the GST 3 protein by 
transient expression of its cDNA into COS cells has been investigated. Since the HEC 
library yielding the GST 3 cDNA was in the pcDNAl.l expression vector, there was no 
need to subclone the GST 3 insert prior to transfection. Co-transfection of the GST 3 
cDNA with a cDNA encoding a GlyCAM-1 /human IgGl Fc chimera resulted in a >10 
fold enhanced incorporation of 35 S-S0 4 relative to transfection with the GlyCAM-1 
chimera alone. Co-transfection with vector cDNA had no effect. BySDS-PAGE 
analysis, incorporation of 35 S-S0 4 counts into the GlyCAM-1 chimera was confirmed. The 
results are shown in Fig. 6. The results indicate that GST 3 encodes a sulfotransferase 
that can utilize GlyCAM-1 as an acceptor. 

B. Specificity of GST-3 (HEC-GlcNAc6ST) Defined with Synthetic Acceptors 

To define the activity of the protein encoded by the novel cDNA described above, 
the novel cDNA was expressed in COS cells and cellular extracts were tested for their 
ability to transfer 35 S-sulfate from 35 S-[3' phosphoadenosine 5' phosphosulfate] (PAPS) to 
synthetic oligosaccharide acceptors using the assay described in Bowman et al., Chem. 
and Biology (1998) 5:447-460. The disaccharide and trisaccharide acceptors (Table 3) 
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were based on core structures of GlyCAM-1 chains (Fig. 7) with the substitution of Gal 
for GalNAc at the reducing termini. 
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Table 3. Nomenclature and Structure of Oligosaccharides and Substituents 




Name 

sLe x (sialyl Lewis x) 
6'-sulfo sLe* 
6-sulfo sLe" 
6',6-disulfo sLe x 
core 2 

disaccharide acceptor 
trisaccharide acceptor 
R 



Structure 

Siaa2-3Gaipi-4[Fucal-3]GlcNAc 
Siaa2~3 [S0 3 -6]Galp 1 -4[Fucal -3]GlcN Ac 
Siaa2-3Galpl-4[Fucal-3][S0 3 -6]GlcNAc 
Siaa2-3[SOj-6]Galpl-4[Fucal-3][S0 3 -6]GIcNAc 
Galp 1 -3 [GlcNAcp 1 -6]GalNAc 
GlcNAcpl-6Galol-R 
Galp 1 -4G lcNAcp 1 -6Galc 1 -R 
C-CH^CH ^CONH.KCHJt-CH, 



Substantial radioactivity was transferred to the disaccharide acceptor 
GlcNAcp l-6Gala-R, but the activity towards the trisaccharide acceptor 
Galpl-4GlcNAcpl-6Gala-R was barely above the control generated by an extract from 
| mock-transfected cells (Fig. 8). The position and regiochemistry of sulfation was 

W 20 established by HPAEC analysis using a previously described protocol (Bowman et al., 
|- 1998, supra). Label was associated exclusively with GlcNAc-6-sulfate (Fig. 9). This 

n analysis establishes that this cDNA encodes a GlcNAc-6-sulfbtransferase, providing the 

basis for the designation HEC-GlcNAc6ST. Furthermore, the enzyme requires a terminal 
GlcNAc residue for recognition, mirroring the profile of activity observed previously in 
25 extracts of isolated HEC (Bowman et al., 1 998, supra). 



C. Sulfation of GlyCAM-1 by KSGal6ST (GST-1) and HEC-GlcNAc6ST (GST-3) 



To test whether KSGal6ST and HEC-GlcNAc6ST were capable of sulfating a 
30 bonafide L-selectin ligand in cells, COS cells were transfected with a cDNA encoding a 
GlyCAM-l/IgG chimera and a cDNA encoding one or the other sulfotransferase. The 
transfected cells were cultured in the presence of 35 S-sulfate and radiolabeled GlyCAM- 
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1/Ig was purified from the conditioned medium on protein A-agarose. It was found that 
substantial incorporation of counts occurred when GlyCAM-l/IgG (66 kDa) was 
cotransfected with either KSGal6ST or HEC-GlcNAc6ST cDNA but not with the empty 
vector. 

In order to establish the regiochemistry of sulfation on radiolabeled GlyCAM- 
1/IgG, samples resulting from the two sulfotransferase transfections were subjected to 
hydrolysis and analysis by HPAEC, using a Dionex HPLC system according to the 
previously established procedures (Hemmerich et al., Biochem. (1994)33:4820-4829). As 
shown in Fig. 10, the retention times of the released sulfated mono- and disaccharides 
corresponded to those of authentic standards for [S0 3 -6]Gal and 
[S0 3 -6]Gaipi-4GlcNAc for the KSGal6ST transfectants and to [S0 3 -6]GlcNAc and 
Galpl-4[SO r 6]GlcNAc for the HEC-GlcNAc6ST transfectants. Thus, the specificities 
of KSGal6ST and HEC-GlcNAc6ST for oligosaccharides of GlyCAM-1 reflected those 
observed with the model acceptors (Figs. 8 & 9, Fukuta et al., 1997). 



D. Contribution of KSGal6ST and HEC-GlcNAc6ST to the Generation of L-selectin 
Ligand Activity 

In order to test whether KSGal6ST and HEC-GlcNAc6ST can contribute to the 
generation of L-selectin ligand activity, the binding of an L-selectin/IgM chimera to CHO 
cells which were transiently transfected with cDNAs for the sulfotransferases and a cDNA 
encoding CD34 was examined. The recipient CHO cells, termed CHO/FTVII/C2GnT, 
were stably transfected with 1) fucosyltransferase VII (FucTVII), which is known to be 
involved in the biosynthesis of L-selectin ligands (Maly et al., Cell (1996) 643-653); and 
2) core 2 01-6 N-acetylglucosaminyltransferase (C2GnT) (Bierhuizen and Fukuda, Proc. 
Nat'l Acad. Sci. USA (1992)89:9326-9330). This latter enzyme provides a core structure 
for O-linked glycans upon which extended chains with sLe x capping groups are 
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elaborated (Li et al., J. Biol.Chem (1996) 271:3255-3264. Binding was measured by flow 
cytometry. As shown in Fig. 1 la, no L-selectin/IgM staining above background was 
observed in the CHO cell transfectants in the absence of sulfotransferase cDNA. Strong 
L-selectin/IgM binding was observed in a significant population of the cells when both 
HEC-GlcNAc6ST and KSGal6ST cDNAs were included in the transfection. The binding 
of the L-selectin/IgM chimera was specific as indicated by its calcium dependence (data 
not shown) and the inhibition of its binding by a function-blocking anti-L-selectin mAb 
(MEL-14) (Fig. 11a). The binding of L-selectin/IgM was dependent on the presence of 
the CD34 protein scaffold, as indicated by the nearly complete loss of staining when the 
CD34 cDNA was omitted from the transfection (Fig.l lb). To determine the individual 
contribution of the two sulfotransferases to L-selectin binding, we cotransfected the CHO 
cells were cotransfected with cDNAs for 1) CD34 and 2) the sulfotransferases, alone or in 
combination. Transfection with either HEC-GlcNAc6ST or KSGal6ST cDNA conferred 
binding of the L-selectin/IgM chimera (Fig. 1 lc). KSGal6ST appeared to exert the 
greater effect, both in terms of the proportion of positive cells and their mean fluorescence 
intensity (Fig. 1 lc). However, the combination of KSGal6ST and HEC-GlcNAc6ST 
cDNAs strongly enhanced the binding of L-selectin/IgM relative to the the single 
sulfotransferase transfectants (Fig. 1 lc). Moreover, the signal resulting from the 
combination clearly exceeded the sum of the signals from the individual transfections, 
indicating that the two sulfotransferases synergized to generate ligand activity. This 
synergistic effect was evident over a range of cDNA concentrations in the transfection 
mixtures (Table 4). 
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Table 4. L-Selectin/IgM Staining of CHO Cells Transfected with Combinations of 
Sulfotransferase cDNAs 



tig cDNA transfected 

KSGal6ST HEC-GlcNAc6ST 

To 0 

1.5 0 

0 1 

0 1-5 
0.5 0.5 

1 0.5 
0.5 10 



MF1 

L-s el-IgM Staining 
364 
391 
114 
163 
935 
917 
830 



CHO/FTVIUC2GnT cells were cotransfected with plasmids encoding CD34 (2 M g) and each 
erase aione or in combination in the indicated ^J2™3Z5^ 
fluorescence intensity (MFI) of L-selectin/IgM stammg m die L-se 1- ^ jPMp^baon ^ 
background signal (from transfectants with CD34 cDNA alone, value 139) subtracted. 

VIII. Rolling Data in a Parallel Plate Flow Chamber 
A. Methods 

1 . Transient transfection of COS-7 cells 

For generation of recombinant GlyCAM-l/IG fusion protein, COS-7 cells were 
grown to 80% confluency in 10 cm culture dishes (Nunc) and transfected with plasmids 
encoding core 2 p-l^N-acetylglucosminyltransferase (C2Gnt) (l/,g), fucosyltransferase 
VII (FT-VII) (l^g), GlyCAM-l/IgG (2/ig) and 0.5, 1 or 2/ig of either KSGaL6ST, 
huGlcNAc6ST, HEC-GlcNAc6ST, or irrelevant control plasmid (mock control), using 
Lipofectamine (Life Technologies) in Opti-MEM (Life Technologies) according to the 
manufacturer's protocol. Cells were grown for seven days in Opti- 
MEM/penicillin/streptavidin. Recombinant GlyCAM-l/IgG fusion protein was isolated 
from the conditioned medium (CM) by affinity chromatography on protein A-agarose and 
transferred to PBS on a Centricon 30 concentrator (Amicon, Beverly, MA). 



2. Laminar flow assays 

The GlyCAM-l/IgG constructs were coated at similar site densities onto 
polystyrene dishes (Corning, San Mateo, CA) as determined by ELISA using 96 well 
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polystyrene plates (Costar, Corning, NY). Proteins were coated in Tris-buffered saline 
(TBS), pH 9.0 overnight at 4°C, washed and blocked with 3% BSA. The immobilized 
GlyCAM-l/IG constructs were detected by ELISA with biotinylated CAMO-5 or 
biotinylated anti human IgG (Fc specific) and streptavidin conjugated alkaline 
phosphatase. For flow experiments the substrate-coated dishes were incorporated as the 
lower wall of a parallel plate flow chamber (Lawrence et al., Cell (1991) 65:859-873) and 
mounted on the stage of an inverted phase-contrast microscope microscope (Diaphot 
TMD; Nikon Inc., Garden City, NY). Jurkat cells were perfused through the flow 
chamber at l-2xl0 6 cells/ml in HBSS with Ca + or Mg + supplemented with 0.2% BSA. 
For inhibition studies, cells were treated with 5 DREG56 (anti L-selectin mAb), 10 
^g/ml Fucoidin (a carbohydrate inhibitor of L-selectin) for 10 min at room temperature 
prior to injection into the flow chamber or perfused in 5mM EDTA/HBSS/0.2% BSA. In 
some cases, substrates were incubated with 5 mU/ml Vibrio cholera sialidase (Oxford 
Glycosystems, Rosedale, NY) for 30 min in 50 mM sodium acetate, 4 mM CaCl 2 , 0.1% 
BSA, pH 5.5 or as a control with buffer alone. 

Experiments were videotaped using a 4x objective (field of view - 1 .9 1mm 2 ) and 
data were analyzed on a computer using NIH Image 1.61. For tethering, cells were 
perfused through the chamber at different shear stresses covering a range from 3 
dynes/cm 2 to 0.2 dynes/cm 2 . The fraction of cells that came into close proximity with the 
substrate and tethered stably (cells that continued to roll for > Is on the substrate after the 
initial attachment) was determined. For rolling velocities, cells were infused for 2 min at 
1 dyne/cm 2 after which shear stress was increased in 1.5 to twofold increments up to 35 
dynes/cm 2 in intervals of 5 s (detachment assay). Cell displacement was followed for 1 - 
3 seconds to determine rolling velocities at each shear stress. In detachment assays the 
number of rolling cells at each shear stress was determined. The maximum number of 
adherent cells was set to 100%, and the number of rolling cells at higher shear stresses 
was expressed relative to this peak accumulation value. In all experiments the observation 
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field was located at the upstream edge of the spot of adsorbed protein to minimize 
contribution of cells rolling into the fields from upstream fields. 

B. Results 

1. Jurkat cells roll on fucosylated and sulfated GlyCAM-l/IgG in shear flow 

The binding of L-selectin to its HEV-ligands requires sialylation, fucosylation 
(Maly et aL. 1996) and sulfation. Maly et al. established that FucT-VII is the relevant 
fucosyltransferase for the elaboration of functional L-selectin ligands in HEC. Our 
previous structural analysis indicated that Gal-6-sulfate and GlcNAc-6-sulfate are equally 
present in native GlyCAM-1 oligosaccharides and that both reside within the sLe* -based 
capping structures (6'-sulfo sLe* and 6-sulfo s sLe*, respectively). In order to test how 
these sulfation modifications contribute to the generation of L-selectin ligand activity the 
rolling of Jurkat cells on immobilized, fucosylated, and sulfated rGlyCAM-l/IgG fusion 
proteins under shear flow conditions was examined. COS cells, which have endogenous 
sialyl transferase activity, were cotransfected with cDNAs for C2GnT, FucT-VII, 
GlyCAM-l/IgG and either KSGal6ST, huGlcNAc6ST, HEC-GLCNAC6ST or control 
plasmid. Recombinant GlyCAM-l/IgG fusion proteins were purified from the 
conditioned medium on protein A-agarose. 

For flow assays the recombinant proteins were coated at equal site densities as 
determined by ELISA (data not shown). Detection of the immobilized GlyCAM-l/IgG 
substrates with the anti-GlyCAM-1 antibody CAMO-5 or an anti-human IgG (Fc specific) 
mAb demonstrated the same level of binding of the various chimeras. Rolling of Jurkat 
cells on the substrates at a shear stress of 1 dyne/cm 2 required the fucosylation of 
GlyCAM-l/IgG, as no rolling interactions were observed on GlyCAM-l/IgG or sulfated, 
non-fucosylated GlyCAM-l/IgG (Table 5). Sulfation of C-6 of either Gal or GlcNAc 
yielded an increase in the number of rolling cells, however the effect was more 
pronounced for the Gal-6-sulfated substrate. An anti-L-selectin mAb abrogated the 
interaction with the substrate as did fucoidin, an anionic polysaccharide that blocks L- 
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selectin-dependent binding. Treatment of the substrates with sialidase from Vibrio 
cholera completely prevented Jurkat cell tethering as previously seen for lymphocyte 
tethering onto PNAd and CD34 (Puri et ah, 1995) or HEV (Rosen et al., 1985) (Table 5). 



Table 5. 

5 Characteristics of the interaction of Jurkat cells with rGlyCAM-l/IgG fusion proteins 







substrate 


Rollins' 


EDTA 2 anti L-selectin mAb 2 Fucoidin 2 


sialidase 1 






rGlyCAM-l/IgG 


0 








10 


rGlyCAM-i/IgG 
FT 


12.2±5 


ooo 


0 






rGlyCAM-l/IgG 
KSGal6ST 


0 








15 


rGlyCAM-l/IgG 
huGlcNAc6ST 


0 






b 

0 


20 


rGlyCAM-l/IgG 
HEC-GlcNAc6ST 


0 






iy 

iJi 5 

m 


rGlyCAM-l/IgG 
FT 

KSGal6ST 


107.0±16.7 






O 


25 


rGlyCAM-l/IgG 
FT 

huGlcNAc6ST 


36.1±15.3 






0 


30 


rGlyCAM-l/IgG 
FT 

HEC-GlcNAc6ST 


36.d±9.5 







1 Jurkat cells were perfused through the flow chamber at 1 dyne/cm 2 at l-2xl0 6 cells/ml and the 
number of rolling cells was evaluated at 2 min of flow. 
35 2 Cells were treated with 5 //g/ml DREG56, 1 0 /ig/ml Fucoidin for 1 0 min at room temperature 
prior to injection into the flow chamber or perfused in 5mM EDTA/HBSS/0.2% BSA to inhibit L-selectin 
mediated rolling. 

3 Immobilized GlyCAM-l/Ig fusion proteins were treated with 5 mU/ml Vibrio cholera sialidase a 
describes in Methods. 
40 4 '0* indicates that no rolling of cells was detected. 

2. Sulfation of fucosylated rGfyCAM-1 enhances L-selectin mediated 
interactions in shear flow 
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To determine the contribution of sulfation to tethering under laminar flow 
conditions, Jurkat cells were perfused through a parallel wall flow chamber in which 
fucosylated GlyCAM-l/IgG constructs with or without sulfate modifications were coated 
at the same site densities. Jurkat cells showed the tethering profile characteristic for L- 
selectin with a shear threshold below which no or little tethering occurred (Fig.12). While 
a maximum tethering rate of 9% was found for the interaction of cells with non-sulfated 
GlyCAM-l/IgG (FT), the fraction of tethered cells was doubled upon sulfation on C-6 of 
GlcNAc (FT, huGlcNAc6ST; FT, HEC-GlcNac6ST) and increased sixfold upon sulfation 
C-6 of Gal (FT, KSGal6ST). Furthermore, the latter modification resulted also in a shift 
of the tethering threshold towards lower shear stresses (Fig.12). 

Sulfation of C-6 of GlcNAc reduced the velocity of rolling cells consistently 
relative to the velocities observed on non-sulfated GlyCAM-l/IG. The reduction in 
velocity was more pronounced on the Gal-6 sulfated substrate than on the substrates with 
GlcNAc-6 modifications (Fig.13). Gal-6 sulfation resulted in cells that were more 
resistant to detachment at increased shear stresses indicating an increase in binding 
strength (Fig. 14). Small increases in binding strength were observed on GlcNAc-6 
modified substrates: 

It is apparent from the above results and discussion that a novel human glycosyl 
sulfotransferase, as well as polypeptides related thereto and nucleic acid compositions 
encoding the same are provided by the subject invention. These polypeptide and nucleic 
acid compositions find use in a variety of diverse applications, including research, 
diagnostic, screening and therapeutic applications. Also provided are improved methods 
of treating diseases associated with selectin-sulfated ligand mediated binding events, since 
agents that selectively reduce or inhibit the activity of the subject enzyme are employed, 
so that other sulfotransferases whose activity is beneficial are not adversely affected. 
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All publications and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. The citation of 
any publication is for its disclosure prior to the filing date and should not be construed as 
5 an admission that the present invention is not entitled to antedate such publication by 
virtue of prior invention. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it is readily apparent to 
1 0 those of ordinary skill in the art in light of the teachings of this invention that certain 

changes and modifications may be made thereto without departing from the spirit or scope 
of the appended claims. 
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